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UNDER M.P.E.P. section 708.02, VIII 

1. Petition 

Applicant hereby petitions to make this new application, which has not received any 
examination by the Examiner, special. 



2. Claims 

All the claims in this case are directed to a single invention. If the Office determines that all the 
claims presented are not obviously directed to a single invention, then applicant will make an 
election without traverse as a prerequisite to the grant of special status. 
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Field of search: The following areas were searched: 



Class 178, subclass 67; 

Class 332, subclass 103; 

Class 341, subclass 61; and 

Class 375, subclass 39, 59, 261, 265 and 298; 
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Non-patent literature included: 

"Efficient Modulation for Band-limited Channels", G.D. Forney Jr. et al. 
"Multidimensional Constellations - Part I: Introduction, Figures of Merit, and 

Generalized Cross Constellations", G.D. Forney Jr. et al. 
"A Coherent Digital Amplitude and Phase Modulation System", C.N. Campopiano and 

B.G. Glazer 

"An Efficient Power-Reduction Technique on DSL Modems", H. Kwok and D.L. Jones 
"PAR Reduction Via Constellation Shaping", H. Kwok and D.L. Jones 
"Higher-Order Optimum Hexagonal Constellations", CD. Murphy 

4. Copy of references 

There is submitted herewith a copy of the references deemed most closely related to the subject 
matter encompassed by the claims. 
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Also attached is Form PTO-1449. (PTO/SB/08A and 08B) 

5. Detailed discussion of the references 

There is submitted herewith a detailed discussion of the references, which discussion 
particularly points out how the claimed subject matter is distinguishable over the references. 

Also attached is a copy of the Information Disclosure Statement previously filed with the 
application. Copies of references already filed and of record are not included herewith. MPEP 
§708.02(Vm)(D). 

Applicant claims a quadrature amplitude modulation signal space, comprising a symmetric 
spherical quadrature amplitude modulation constellation in a multidimensional complex plane, 
the constellation bounded by a surface comprising all symbol points at a predetermined distance 
from a center point, coincident with an intersection of at least two axes, and corresponding in 
relative position to the symbol points on opposite sides of the axes. The following discussion of 
the references points out with the particularity required by 37 CFR §1.1 11(b) and (c), how the 
claimed subject matter is patentable over the references. 

U.S. Patent No. 5,150,381 to Forney, Jr. et al. and assigned to Codex Corporation suggests 
constellations that are shaped to be as nearly spherical as possible. Forney further describes the 
implementation of a quasi-spherical constellation utilizing the Voronoi region of a lattice to 
define the constellation boundary. Referring to Fig. 3 of the Forney patent, the quasi-spherical 
constellation described by Forney does not anticipate, suggest or teach the symmetric spherical 
QAM constellation claimed in the present invention. 

U.S. Patent No. 5,515,400 to Arai and assigned to Fujitsu Limited describes a signal point 
arranging method for use with a quadrature amplitude modulator/demodulator device. The 
constellation described by Arai comprises a plurality of concentric circles with their center at the 
coordinate origin of a rectangular plane, each of the concentric circles having a radius equal to 
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that of a previously established symmetric grid, the desired number of symbol points then being 
arranged on the concentric circles. Referring to Fig. 4 of the Arai patent, the constellation taught 
by Arai does not anticipate or suggestion the symmetric spherical constellation claimed in the 
present invention. 

IEEE Journal Article, "Efficient Modulation for Band-Limited Channels", describes several 
QAM constellation schemes well known in the art, including rectangular constellations, cross 
constellation, circular constellations and hexagonal constellations. It can be see from Fig. 5 and 
Fig. 6 of the article, that these known constellations do not describe the symmetric spherical 
constellation claimed in the present invention. 

IEEE Journal Article, "Multidimensional Constellations - Part I: Introduction, Figures of 
Merit, and Generalized Cross Constellations, describes the theoretical advantages of a spherical 
constellation, however a method for implementing a spherical constellation is not described. 

The Henry K. Kwok and Douglas L. Jones articles, "An Efficient Power-Reduction 
Technique on DSL Modems" and "PAR Reduction via Constellation Shaping", describe a 
technique for shaping a constellation into a hypersphere in an effect to reduce the peak-to- 
average ratio. This shaping technique results in constellations with a spherical boundary, but it 
does not provide a symmetric spherical constellation. 

Applicant believes the remaining references to be immaterial to the patentability of 
Applicant's invention. However, a brief discussion of each reference is provided to illustrate the 
field of search. U.S. Patent No. 3,887,768 to Forney, Jr. et al. and assigned to Codex 
Corporation describes a quadrature amplitude modulation constellation comprising signal points 
mapped on the complex plane which are drawn from an alphabet consisting of at least 8 points, 
and are set up in concentric rings each rotated by 45° with respect to adjacent rings. U.S. Patent 
No. 4,894,844 to Forney, Jr. et al. and assigned to Codex Corporation describes a Voronoi 
constellation. U.S. Patent No. 5,606,578 to O'Dea and assigned to Motorola, Inc. describes a 
modulation method utilizing known signal constellations. U.S. Patent No. 5,621,761 to Heegard 
and assigned to General Instrument Corporation of Delaware describes a rotationally invariant 
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trellis coder. U.S. Patent No. 6,026,120 to Betts and assigned to Paradyne Corporation describes 
a system and method for communicating information using circular multidimensional signal 
space constellations. U.S. Patent No. 6,101,223 to Betts and assigned to Paradyne Corporation 
describes a system and method for optimizing the uncoded transmission and reception of data 
using circular constellations. U.S. Patent No. 6,392,500 to McCallister et al. and assigned to 
Silcom, Inc. describes a rotationally invariant phase point constellation utilizing phase point 
voids. IRE Transactions of Communications Systems article, U A Coherent Digital Amplitude 
and Phase Modulation Scheme", describes a technique for a combination of amplitude and phase 
modulation. The article "High-Order Optimum Hexagonal Constellations" by Charles D. 
Murphy describes a method of constructing hexagonal constellations. 

Section 102 of the United States Patent Laws provides in relevant part: 



None of the references obtained in the prior art search disclose or describe Applicant's 
invention. 

Section 103 of the United States Patent Laws provides in relevant part: 



No combination of references obtained in the prior art search teach or suggest Applicant's 
invention. 



A person shall be entitled to a patent unless ... the invention was known or used 
by others in this country, or patented or described in a written publication in this 
or a foreign country . . . 



A patent may not be obtained ... if the differences between the subject matter 
sought to be patented and the prior art are such that the subject matter as a whole 
would have been obvious at the time the invention was made to a person having 
ordinary skill in the art to which said subject matter pertains. 
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Abstract— This paper attempts to present a comprehensive tutorial 
survey of the development of efficient modulation techniques for band- 
limited channels, such as telephone channels. After a history of advances in 
commercial high-speed modems and a discussion of theoretical limits, it 
reviews efforts to optimize two-dimensional signal constellations and pre- 
sents further elaborations of uncoded modulation. Its principal emphasis, 
however, is on coded modulation techniques, in which there is an explosion 
of current interest, both for research and for practical application. Both 
block-coded and trellis-coded modulation are covered, in a common frame- 
work. A few new techniques are presented. 



I. Historical Introduction 



B AND-LIMITED channels (as opposed to power- 
limited) are those on which the signal-to-noise ratio is 
high enough so that the channel can support a number of , t ^ , , . , v ^ , 

bits/Hz of bandwidth. The telephone channel (particularly ture amphtude modulation' (QAM)^ wi^a.;i6-pp^ 

♦u- a-a^*~a v™ u ; „™_u„ u_ constellation to send 4 bits/Hz in a^nominal 24C 



decade. (It was not unknown in this period to encounter 
users who thought that Nyquist or Shannon or someone 
else had proved that' 2400 bits/s was about the maximum 
rate theoretically possibleon phone lines!)).^ V 

The first commercially important 480£ Ht/s mod^ 
the Milgo .4400/48/mttoduced^ 
eluded a manually adjustable equalizer*^ 
nominal 1600 Hz bandwidth in-^njuncjton|^i^^ 
modulation to send 3 bits/Hz. >The?dp^ 
adaptive equalization soon aUowed expansion of the 
nal bandwidth to 2400 :fc, r essenti^ 
line bandwidth. Following a /first? generauW(o|j 
sideband 9600 bit/s modems in?the:late.l960*s, which 
only marginally successful, broad success was achieved 
the Codex 9600C,' introduced in : 1971* ; which used -qu 



the dedicated private line) has historically been the scene 
of the earliest application of the most efficient modulation 
techniques, for band-limited channels. The reasons have to 
do both with the commercial importance of such channels 
and with the fact that they can be modeled to first order as 
linear time-invariant channels, sharply band-limited be- 
tween typically 300-3000 Hz, with high signal-to-noise 
ratios, typically 28 dB or greater. Their relatively low 
bandwidth permits a great deal of signal processing per 
. transmission element, and therefore early application of 
the most advanced techniques, which have often then been 
applied several years later to broader-band channels (e.g., 
radio). ) . 

. \ '. The earliest commercially important telephone-line mod- 
ems appeared in the 1950's and used frequency shift keying 
to achieve speeds of 300 bits/s. (Bell 103), or 1200 bits/s 
von private lines (Bell 202). The earliest commercially im- 
portant synchronous modem was the, Bell 201 
in about 1962, which used 4-phase modulation 
nal 1200 Hz. bandwidth to achieve 2400 bits/s on private 
lines. This remained the state of the art for most of the, 
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4 : bits/Hz* 

bandwidth. (16-point QAM had been used at 4800lbits 
by ESE and ADS about 1970.) Modems 'with' these 
acteristics remained the state of the art.fpr r another^d 
and it was thought by many (includrng^spme 1 of the^R 
authors, who should have^known^b^^ 
modems would never be : widely [ised^cornm 

In 1980, a first ' generation ^o^4|^f|)^|m^^ 
introduced by Paradyne (MP:^|^^Up^e^^§8 
the C^ex/ESE/;sm4fmd|b^^e^a^ 
simply extended 2400 N Hz QAM|modulation|tqi6 
by using 64-pomt si^al; constellation 
, vanced implementation techniqueis and^exploitmgjthe 
ual upgrading of <he' t^oo^^^t^ed^ 
successfully over a high percentagfepf^u^n 
generation of 14.4'kbit/simodem^at^^eri^ 
ing in 1984, coded QAM'modulationji^ 




techniques.' 

This evolution of high-speed' modeins to'cever^mgh 
rates using successively more ^comphcated^Jmod 
schemes is summarized m : Table'l;|along^wim |th^| 
tion and year of final adoption; of; CCITT|uii^^ 
standards embodying these * schemes!t;Howtfar^^a 
evolution go? History would suggest caution m'stijf^ 
any ultimate ceiling. However, wimout- any' dramatic 
eral upgrading of the telephone network, we venturelto 
that 19.2 kbits/s is the maximum conceivable rate 
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TABLE I 
Modem Milestones 



■if:. 



■err 



YEAH 


.,, M00EL 


SPEED 


BANDWIDTH 


n 


CONSTELLATION 


COMMENTS 


CCITT STANDARD 


1062 


Ball 201 


2400 


1200 Hz 


2 


4-phaee 


fixed eq. 


V.26 (1966) 


1967 


MQgo 4400/48 


4600 


1600HZ 


3 


frphase 


manual eq. 


V.27(1972) 


1971 


Code i 9600 C 


9600 


2400 Hz 


4 


1&OAM 


adaptive eq. 


V.29 (1876) 


1960 
1981 


Paradyne M PI 4400 
Codex/ESE SP14.4 


14.400 
14,400 


2400 Hz 
2400HZ 


6 

6 


64-QAM 
640AM 


rectangular 
hexagonal 




1064 




.14,400 


2400HZ 


6 


120OAM 


6-state trellis 





3iV.i*v .'- ■ ;tL .X'jt Vr., -tV. * .•" * . J« ; '* . 



pulses/sec. 

' LPF: low-pass filter; bandwidth F/2 Hz 
f ; «c : carrierjfrequency (raldians/sec). 
: s(t): ^IrdslohaJ^lWvV.;^;- ' 1 




tew* 



Fig. 1. Canonical QAM modulator! 



f r;: 



.{..■v.r.i/ii- 



vf: 



n* 

4 



yt 



(x, Vydfc "coordinate 
(n^vnyi):'2-dlmen 

(x f t f/y',): j2-dlmenslonal received point. 



;> u ■' ' (X| Vyj: coordinates of 2-dlmenslonal signal point, 
(n^vrv): '2-dlmensional gausslan noise variable. 



♦ 



Fig. 2. QAM channel model. 



—Jp^f/i^-^.; ....... 

a^none-line modem for general use, even with all-out use 
^Se^most powerful coiled modulation. We shall see. 



vat* r- 



Channel Model and Basic Limits 

J^Srature ^ amplitude modulator ca!n be used to gener- 
^^standard linear double-sideband modulated carrier 
^^(includihg 1 f forms of phase modulation and 
Samplitude i ^modulation), which includes all types of 
^SfeM^iii^^^Juse^ in : synchronous modems. A 
SSSwl XfAM. modulator is shown in Fig. 1. 
Rt ?u1mng ;, thklt^ only channel impairment is Gaussian 
Jpid ^that the /receiver achieves perfect carrier phase 
^S^ihe^miple' model of Fig. >2 -applies. Signals are 
^'pairs (x' t ;y t )\ the channel is essentially two-dimen- 
.JSa^We shall call such a pair a "signal point," imagined 
SEeta a two-dimensional plane. Signal points are sent at 



a regular rate of F points/s where F Hz is the nominal 
(Nyquist) bandwidth of the channel. A signal point is also 
called a "symbol," and the symbol interval is 1/Fs. The 
model indicates that the two signal point coordinates 
(x t9 y t ) are independently transmitted over decoupled 
parallel channels and perturbed by Gaussian noise vari- 
ables {n xn n yt ) y each with zero mean and variance N. 
Alternatively, we could regard the two-dimensional signal 
point as being perturbed by a two-dimensional Gaussian 
noise variable. If the average energy (the mean squared 
value) of each signal coordinate is S, then the signal-to-noise 
ratio is S/N. 

The simplest method of digital signaling through such a 
system is to use one-dimensional pulse amplitude modula- 
tion (PAM) independently for each signal coordinate. (This 
is sometimes called narrow-sense QAM.) In PAM, to send 
m bits/dimension, each signal point coordinate takes on 



"A.N . 



mm 
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one of 2 m equally likely equispaced levels, conventionally 
±1, ±3, ±5,- • •, ±(2 m -1). The average energy of each 
coordinate is then 



and it follows that 



S„ = (4 m -l)/3, 



S m+ i = 4S m + l. 



That is, it takes approximately (asymptotically, exactly) 4 
times as much energy (or 6 dB) to send an additional 1 
bit/dimension or 2 bits/symbol. The probability P(E) 
that either x t or y t is in error is upperbounded and closely 
approximated by the probability that the two-dimensional 
Gaussian noise vector (n xn n yt ) lies outside a circle of 
radius 1, which is easily calculated to be P(E) = 
exp(- 1/2N). A noise variance N of about 1/24 therefore 
yields an error probability per symbol in the range of 
about 6X10 " 6 . 

The channel i capacity of the Gaussian channel was 
calculated in Shannon's original papers [1]. Subject to an 
energy constraint x, 2 < S, the capacity is 

C = (1 /2) log 2 (1 + S/N ) bits/dimension, 

achieved when x t is a zero-mean Gaussian variable with 
variance S. Note that when S becomes large with N 
constant, it takes approximately (asymptotically, exactly) 4 
times as much power to increase the capacity by an ad- 
ditional 1 bit/dimension. Therefore the ratio of bits/di- 
mension achieved by PAM to channel capacity approaches 
1 as S becomes large. This fact was used in [2] to argue 
that coding has little to offer on highly band-limited chan- 
nels. 

We can make a more quantitative estimate of the poten- 
tial gains from coding as follows. Using narrow-sense 
QAM (two-dimensional PAM) to send m bits/dimension 
or n = 2 m bits/symbol at an acceptable error rate (of the 
order of 10~ 5 -10" 6 ) requires an average energy in each 
dimension of 

S=(2"-l)/3 

when N = 1 /24, or S/N = 8 X 2" for n moderately large. If 
channel capacity could be achieved, we could send about 
n' = \og 2 (S/N) bits/symbol, or about n+3 bits/symbol 
at the same signal-to-noise ratio. Thus, the potential gain is 
about 3 bits/symbol or, alternatively, about a factor of 8 
(9 dB) of power savings. 

Many authors (see, e.g., [3], [4]) regard the parameter R 0 
as a better estimate than C of the maximum rate that is 
practically achievable using coding. On the Gaussian chan- 
nel, R Q is [5] 

R o = (1/2) log 2 (1 + S/2N) bits/dimension. 

It thus takes a factor of 2 (3 dB) more power to signal at 
R 0 than to signal at C. The maximum practical improve- 
ment obtainable by coding might therefore be estimated as 
of the order of 6 dB, or 2 bits/symbol (although the R 0 
estimate is not universally accepted). 



In what follows we shall show that simple coding t 
niques gain about 3 dB or 1 bit/symbol, while .theim 
elaborate techniques described have theoretical gams^ 
order of 6 dB or about 2 bits/symbol. This r is en'' 
consistent with the R 0 estimate given above, and sugg" 
that little can be gained by seeking stiU more^ejabo 
schemes. (In [6] the capacity of the telephone'ch^nel 
estimated as of the order of 23 500 bits/s^rdugfiy 
sistent with what we are saying here.) ^ 

III. Uncoded Modulation Systems 

Digital QAM signaling schemes are conventional^ 
usefully represented by two-dimensional constehaUpni 
all possible signal points. A 2 w -point constellation 
used to send n bits/symbol. A fair amountofjeffo 
gone into finding "optimum" cons teUa dons We 
shortly see that the payoff for this,, effort* on. 
Gaussian-noise channels is relatively slight,, althoi^ 
schemes found are helpful precursors for more^ela 
schemes. . . • . ■ OVr" 



A. Rectangular Constellations } : , \ : ^ ; ( b 

- - ' ■ ■• ■ .* '••»>":, \m 
A brief flurry of theoretical papers in the early _ 
[7]-[10] developed two-dimensional, signd\constegai 
from various viewpoints. The., most ^interestmg^ 
present purposes are the family of constellations dexel 
by Campopiano and Glazer [9], . reproduced in t Fi^ 
have taken the liberty of substituting a "cross'coj^uM ' 
for theirs at n = 7; the two are. equally good.)^ 
integer numbers of bits/symbol, ^ . c^stell^ffi 
simply representations of two'in^ependent 'PAM' 
so the constellations are square, ^d^have^ points 
from the rectangular lattice , of, ppintej -^th|jod^ 
coordinates. It takes about ;.6 ,dBvmqre^pow,^ 
more bits/symbol, as expected. If orrodd^integ^^ 
of bits/symbol, the constellations lie" wt^Jaa^JLdg 
the form of a cross (and have,. 
"cross constellations") and'the ^po^^w^ra^^o 
same rectangular' lattice (except ^ 
tion, where the outer points/, are ApiU^pn^A^ 
symmetry and energy, say^gs).|^iAjAe|fi^ 
that the minimum distance , between an^t^g^g 
to 2, the average signal energy^ .in;,absoli^temg^ 
is as given in Table 11* Wft ;.see' that ^ th^jor^^^ 
tions" require about 3 dB more'.or;less*th^ 
or higher square consteUation, >resp^ 
expect. ' 'o;^(:f\^^$^ 

The Campopiano-Glazer construction^ c^bl 
eralized as follows: from an infini^,fu^a^ w p^GWg 
packed in a regular array or lattice, selectra clpsd^i 
subset of 2" points as a signal cpnsteUation.vl^iSka{^ 
principle is at the root of much recent wojfc^i^ 
explore appUcations of this principle, working ^Riffi® o 
simpler to the more sophisticated.;; *.> y\p^ 

When constellations are drawn from :a, ; regular 
within some enclosing boundary, the following ^asying 
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2a 
a 

a | 2a | 3a 


h 





/a = 2(^)«for Campopiano/Glazer constellations, n odd/. 

Fig. 4. Cross constellation boundary. 
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n = 4 cross (S = 10) 



• • .* 



• • • • • 
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Rectaneularsignal constellations (after Campopiano and Glazer 

n -■ - ■. ■.m. .... ■• 

■ill' ' >' >--. f . ■■■ ' 

■ f .V.- w TABLE II : 

Campopiano-Glazer Constellations 



n = 6 cross (S = 41) 



• • • • 

• • • • 



• No. Poirr 


ts 1 S" 


(dB) 




IdBl 


„ « 4 . 


2 , 


3.0 


. 27 


4.3 




V' \ 5.5 


7.4 1 






1 * v 


10 


10.0 


10.7 


10.3* 




U- 20 ., 


13.0 


20.7 


13.2 


64 


. 42 


16.2 


42.7 


16.3 


-128 


*' "i 82 t 


19.1 


82.7 


. 19.2 > 


256 


■ 170 


22.3 


170.7 


22.3 



^matioh.is useful -an^ for con- 

b^nsj)f even, moderate size (16 points and up). Let A 
d&ea,bf t the region consisting of points that are closer 
given lattice point , than to any other (i.e., the decision 
"belongihg t to that point, or the Voronoi (or Dirich- 
re^ohjll]). If we take some boundary that encloses a. 
in space of area B, then the number of lattice points 
^region will ^ be approximately 5/^, and the average 
ofjtfiese lattice points will be approximately equal 
aVerage^riergy (x 2 + y 2 ) of all points within the 

oaSt£uriple,,the Campopiano-Glazer constellations are 
troftiv a, rectangular lattice, with U = 4, and are 
ted by a s squafe of side 2 X 2 n/2 and area B s = 4 X 2" 
^even/ arid. by ! a cross of area. B e = 4x2" for n odd, 
'dimensions' as shown in Fig. 4. The energy calculated 
e integral approximation is : 

H "V* '5= (2/3)2 n (square) 
= (31/48)2" (cross). 



n = 6 circular (S = 40.9) 

Fig. 5. Improved rectangular constellations. 

Comparisons between S and S are given in Table II; the 
approximations are good ones. Furthermore, note that the 
cross is slightly more efficient than the square, by a factor 
of 31/32 or 0.14 dB (because it is more like a circle); this 
suggests that the cross would be the better shape even for n 
even, and indeed this is the case and can be easily achieved 
by taking alternate points from the next higher cross 
constellation, as shown in Fig. 5 for n = 4 and n - 6. The 
n = 4 cross constellation is as good as the conventional 
4x4 square constellation, and the n = 6 cross constellation 
is 0.1 dB better than the 8x8 square constellation, about 
as predicted. 

Of course, the best enclosing boundary would be a circle, 
the geometrical figure of least average energy for a given 
area. A circle with radius R has area itR 2 and average > 
energy equal to R 2 /2\ setting R 2 = 4x2", we find that the 
average energy for a 2 w -pbint circular constellation ought 
to be about 

S=(2/7r)2" (circle) 

which would be only about tt/3 or 0.20 dB better than the 
square, or 0.06 dB better than the cross. Fig. 5 also shows a 
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n = 2 
S = 2(3.0dB) 



n = 3 
S = 4.3125(6.3dB) 



n=4 
S = 8.75 (9.4dB) 



A* 

* 7 

■ V. 



• • • 



• • • 



9 • • • 

• • • 



• • • 



• • • 



• • • 



• • • 



• • • 



• • • • 



n=5 
S = 17.59 (12.5dB) 



• • • • 
• • • • 

• • • 



• • • • 

n = 6 
S = 35.25 (15.5dB) 



no3 
So 4.5 (6.5dB) 



Fig. 6. Optimum hexagonal constellations. 



1 ' 



■ * r ; > ; • /> 35 - 4 ^ 75 (15.5dB) 



more circular constellation for /i = 6, with the outer points 
moved to the axes as in Campopiano-Glazer's 8-point 
constellation; this constellation has been used in the Para- 
dyne 14.4 kbit/s modem and, like the n = 6 cross, is about 
0.1 dB better than the n = 6 square. 

B. Hexagonal Constellations 

1 As the densest lattice in two dimensions is the hexagonal 
lattice (try penny packing), constellations using points from 
a hexagonal lattice ought to be the most efficient. Indeed, 
the area of the hexagonal Voronoi region for a hexagonal 
lattice with minimum distance 2 is 2\/J = 3.464, or 0.866 
the size of the square region, which according to our 
approximation principle should translate to a 0.6 dB gain 
for a hexagonal constellation over a rectangular one with 
the same boundary. (A hexagonal boundary, as suggested 
in [12], has an energy efficiency within 0.03 dB of the 
circular boundary, or 0.03 dB better than the cross.) 

Fig. 6 shows the best hexagonal packings for n = 2 
through 6. For n > 4, the predicted 0.6 dB gain is effec- 
tively obtained over the best rectangular packings. (Histori- 
cal notes: suggestions that the hexagonal lattice would 
asymptotically be the best were made very early; see, e.g., 
[13]. The suboptimal n = 3 "double diamond" structure 



was actually used in a 4800 bit/s Hycom .moden^ 
mid-1970 , s. There, was a great deal; of attentionjto 
structures in the early 1970's bwause^of^themim 
in 9600 bit/s. modems;*' the v rather ^strangeJoq 4 ' 
shown here was apparently first discoyer^by^oscUni 
al. [14], and is still the best ; 16-po^ 
The n = 6 suboptimal structure s isiused!in^thexCcde^ 
SP14.4 modem.: 



IV. 



Elaborations of Uncoded/Modulabon 

In this section we shall discuss ;fui^erkvariantslp 
coded modulation: constellations. withinonunifonmp 
bilities, higher-dimensional-, uncoded^cpnsteUatiDns, 
constellations for .nonintegral numbers of bits/symbol! 



,4. Nonuniform Probabilities r . :Vi (v v^!p<^^f--^' 

'■<•■:■■ ''"<^&t^w: 

Attainment of the channel capacity bound; requires 
the signal points have a Gaussian probabihtydistibuti 
whereas with all the constellations of rthe previous' sectiol 
is imphcit that points are to be used with equal ^rpb 
hies. A uniform circular distribution of ri 
average energy S c = R 2 /2 and entropy H c = log 2 ^M 
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001 I 11000 
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in 



pr ,:b*t<r- ■ • 11 ;4inio # ' ' # : 

jfc 'a* ' » ; Fig:" 7. / Source-coded constellation. 

^dimensional Gaussian distnbution of vanance a z 
TM^n^ 2a 2 and entropy 

J§§^ ' ' ' •■ 

^pel^li^ same erltr^py^ the : Gaussian distribution re- 
jfe^factM dB)' less * average 

raer^'thaii thef OTCular distributi6h; ? : "* 
TfeonM ofp"7«)risteUati6h with nonuniform 
^aoiliM p^^tsTnumber of practical problems. One 
?sibie way;df f athievmg somW of the, potential gain is to 
^Vhe^inrarmng of nonuniform 

Sg&^ac&r^ to map the 

l^fix code words; infe : si^ points dra\^ as before from 
S|gi^ probability assocP 

fe"fatifa ''p^^e woH'-ofldiigfli / bits is then 2"'. 
jjjlffi^p^ of prefix code words and a 

lapping onto the ^hexagonal lattice that yields an average 
ft^'of^S- 7:02' .while tri^mtting an average of 4 
^s^bol*^ 1 dB over the 

^ = 4"unif6rin code known. Of course, the fact that 
Spmb^of^dati bits transmitted per unit time is a 
fflam ^vmaol^ leads to system problems (e.g., buffering, 
"ayjr 'that ''i'i^*^twi^-'BSky possible improvement in 
aal-to-noise margm. * 

Wigher- Dimensional 

Ins possible to achieve the same gain in another way by 
Sing blocks' ot data into higher-dimensional constel- 
SSohs without going to the true block coding to be de- 
l^^later '.sections. (By "true coding," we refer to 
BSnes in ; which ' the distant sequences in a 

^Sef niimrjer of dimensions is greiater than that between 
feints m two^dimensions.) We have already seen in Section 
*^tKat-a small (0.2 dB) gain 'is possible by going from 
SS^i^eSsional ?■ PAM . ' to two-dimensional QAM and 
Boosing :poin't? ; bii { 'a^ two-dimensional rectangular lattice 
^^thin^a'fcifcular father than 5 a' square boundary. In 
|same way; by going to' a higher number W of dimen- 
So^and choosing points on an JV-dimensibnal rectangu- 
"^lattice from within an W-sphere rather than an iV-cube, 
Ser^modest savings are possible. "Table III gives the 
8er^!*savingi5 possible 'in JV* dimensions," based on the 
iffererice between average energy of an AT-sphere versus 
ffljFjV-cube of the same volume. Note that as AT goes to 
infinity, the gain goes to ™?/6 (by the Stirling approxima- 
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TABLE III 

Energy Savings from N -Sphere Mapping 



N 


Gain 


dB 


2 


105 


.20 


4 


1.11 


.45 


8 


1.18 


.73 


16 


1.25 


.98 


24 


1.29 


1.10 * 


32 


1.31 


1.17 


48 


1.34 


1.26 


64 


. 1.35 


1.31 



Gain . -^£3- l(WZ)\)-V" (tor N even) 
12 



tion, (n\)~ 1/n goes to e/n), or 1.53 dB; the improvement 
over N=2 goes to e/2 or 1.33 dB, as computed above. 
This is because for large TV, the probabilities of points in ; 
any two dimensions become nonuniform and ultimately 
Gaussian. (It seems remarkable that a purely geometric 
fact like the asymptotic ratio of the second moment of an 
AT-sphere to that of an JV-cube can be derived from an ', 
information- theoretic entropy calculation in 2-space, but so 
it can.) 

Implementation of such a scheme also involves added 
complexity that may outweigh the performance gain. To 
send n bits/symbol in N dimensions (assuming N even), 
incoming bits must be grouped in blocks of Nn/2. Some 
sort of mapping must then be made into the 2* n/2 N- 
dimensional vectors with odd-integer coordinates (assum- 
ing a rectangular lattice) which have least energy among all 
such vectors. This can rapidly become a huge task; and a 
corresponding inverse mapping must be made at the re- , 
ceiver. Compromises can be made to simplify the mapping, 
at the cost of some suboptimality in energy efficiency; e.g., 
the cross is an effective compromise between the square, 
and the circle in two dimensions. 

C. Nonintegral Number of Bits /Symbol. 

It is sometimes desirable (as we shall see in Section VI) 
to transmit a nonintegral number of bits/symbol. Since in 
general an additional 1 bit/symbol costs about an ad- 
ditional 3 dB, it ought to be possible to send an additional 
1/2 bit/symbol for about 1.5 dB. In this section we give a 
simple method that effectively achieves such performance. 
The method can be generalized to other simple binary 
fractions at the expected costs, but we shall omit the 
generalization here. 

To send n+l/2 bits/symbol, we proceed as follows. 
Use a signal constellation comprising 2" "inner points" 
drawn from a regular grid, such as any of those of Section 
III, and an additional 2"" 1 "outer points" drawn from the 
same grid and of as little average energy as possible, 
subject to whatever symmetry constraints may be imposed. 
Incoming bits are then grouped into blocks of 2n + 1 bits 
and sent in two successive symbol intervals as follows. One 
bit in the block determines whether any outer point is to be 
used. If not, the remaining 2n bits are used, n at a time, to 
select two inner points. If so, then one additional bit selects 
which of the two signals is to be an outer point, n - 1 bits 
select which outer point, and the remaining n bits select' 
which inner point for the other signal. (That is, at most one 
outer point is sent.) With random data, the average 
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Si- 10. S 0 = 26, S-14 (11.5dB) 



n-5 

S, - 20. So- 54. S- 28.5 <14.5dB) 



n-6 

S, - 42. So - 1 02. 5 - 57 (1 7.6dB) 



S, - 82. S« - 21 6. S - 1 1 6 (20.6dB) 

Fig. 8. Constellations to send (n + 1) bits/symbol. 

energy is 3/4 the average energy of inner points plus 1/4 
the average energy of outer points. Fig. 8 shows constella- 
tions of 24, 48, 96, and 192 points that can be used in such 
schemes for 4 < n < 7; the average energy in all cases for 
w+1/2 bits/symbol is approximately halfway between 
that needed for n and that for n+1 bits/ symbol. Thus, 
these constellations are intermediate between the Campo- 
piano/Glazer constellations in the same way that the cross 
constellations are intermediate between the squares. (In 
fact, it can be shown that the 2-dimensional cross constel- • 
lations can be derived from 1 -dimensional PAM constella- 
tions with "inner" and "outer" points in an analogous 
way.) 

V. Coding Fundamentals 

Heretofore we have been concerned with methods of 
mapping input bits to signal point constellations in two or 
more dimensions, where in higher dimensions the bits 
simply lie on the lattice that is the Cartesian product of 
two-dimensional rectangular lattices, so that the distance 
between points in iV-space is no different from that in two 
dimensions. Now we shall begin to discuss methods of 
coding of sequences of signal points, where for the pur- 
poses of this paper we mean by coding (or "channel 
coding") the introduction of interdependencies between 
sequences of signal points such that not, all sequences are 
possible; as a consequence, perhaps surprisingly, the 
minimum distance d^ in iV-space between two possible 
sequences is greater than the minimum distance d Q in 
2-space between two signal points in the constellation from 
which signal points are drawn. Use of maximum likelihood 
sequence detection at the receiver yields a "coding gain" of 
a factor of d^/dl in energy efficiency, less whatever 
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additional energy is needed for signaling. (In practice^ 
some of the "coding gain" may be lost due to there being^l 
large number of sequences at distance d^ from:th^ 
" correct sequence and therefore a large number of. possibiJi| 

ties for error, called the "error coefficient" .effect. 'We^'sJ 
not be able to discuss the "error coefficient" much in!:; 
paper, but offer some general remarks at the end of Sectio3| 
VIII.) " -v ;?; >v." ' ;W||r 

Conventional coding techniques cannot be, directly j 
plied in conjunction with band-limited modulation { 
niques, at least with significant gain. (In 1970-19J1^ 
least four companies prototyped conventional ^cod' 
schemes for use in high-speed modems; two of the compa| 
nies failed, and two shortly withdrew their products f 
the market.) In recent years, however, a number; of e__. 
tive coding techniques have been developed for such 2 
cations. The most important point to be made in this ^„ f 
is that all of these coding schemes can be. developed . .f 
common conceptual principle. -This principle t ' t was.set^fi 
clearly by Ungerboeck [15], who called it " mapping^ 
partitioning," although its roots may, perhaps' v .be^oi 
elsewhere as well. We describe, it in .this section, and 
succeeding sections then use it . to develop jail .loiown^ 
some new coding schemes, both block and. trellis^|^ 

We shall consider only 2-dimensional conste^^ 
points drawn from a 2-dimensional ^ rect^gular grid.^ 



research to date, we cannot find any advantage. to^ftartj 
with 
be used 

Such a constellation can be divided into two, subsets! 
assiRning alternate points to each subset; i.e., according! 
the pattern 



hexagonal grids when higher orders of ^coding arej 
iea.) ..-:;». .... 1. ■.&-.,'.(;< ■ j*s#v*«$8il 



B A BV V - A -"7 : -B [:f^k\^:^'C.f^ 
A B a' B P^SM$$M 

. . . / B ^'A ■ * B^^A'^B#^i|^^ 
a.... b j a , .B , t . . A . .. ii.^A l ilQ} s i'M 

; .. " in. 'Kr^y^^^^^^i 

The resulting two subsets ^^^and^A^^Y^t^/fj 11 
properties. •. • V • SlfSfSl 

a) The points in '.each : subset- Ue^^nl^^rectangul^ 
(rotated 45° with respect to the.prigmal^^ll^^^ 

b) The minimum squared distance between. pom^^ 
a subset is twice the minimum squared; distance 
tween points in the .original ,consteUa%n.;^^;^ 

Furthermore, because of the first., prpperty^the^par^^g 
ing can be repeated to yield 4,.; 8, 416;^; ^subsets^^ 
similar properties, and in particular witmnj-subsetsq^ 
distances of 4, 8, 16, • • • times Eig:^?^show||| 
64-point square constellation divided into* twp^subse^ 
32 points, 4 subsets of 16 points, 8 subsets of . 8. points^ 
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A 
B 
A 

V 



A, 6, A, 

B, At B, 

A. B. A, 

B, A, B, 

A. B, 

B, A, 

A. B. 

B, A, 



B, 
A, 
B. 



A, B, 

B. A, 



B. 

A, 
B. 
A, 
B, 



B, A, Bi A, 

A. B, A. B, 

B, A, B, A, 



B. 

A, 
B, 
A, 
B, 
A, 
B. 
A, 



Aon Bo« Am Buo Aooo Boeo Atto B«p 

Bin Abio Bcio A^i Bwi Am Boio Aioi 

An) Bin Aooi Boot A in Bm Aooi Ban 

Bdii Atoo Bm Aoii Boit A,« Bioo A«ii 

A«oo Bom Alio Bt« Am Bo» Ano Bno 

Btoi A«o Boio Aim Bw Aoio Boio Am 

Am Bm Aooi Boot Am B, M Aooi Boo, 

* Bon Atoo Bioo Aon Beii Atoo Biob Aon 



B A B A B 

A * B A B A 

B . As . B ' ; A ' B 

A B A B A 

B A B A \ B *• 

A B A B A ' 
B vJ-A OB . A,. B . 

.■..• } tv.c%/4.j«i:. 1 *;: 

.-wiBi^An Bn ' Ago Boo An B11 .. 
■M*ti > Boi ; A, 0 ' B«< Aii v Bti. Aio^j . 
I*nfBii* Aoo Bn An ' Bn An Boo 
■MAUI Bio;' Aoi; Boi J Aw'. B« 'A* r \ jj 
|A»IBii An B„ An ' Bn An B„ . 
|8»j A*, J Boi A« ; Bio Aoi I Boi ' Ato > 
lAnfBii An Bn An Bn A* Boo . 
|M Bio Aoi & Boi(f A«l* B« ! A*, £ , • 

IVFiglf 9;^. r 64-point constellation partitioned four times. ■ 

pbsets of^.^pointSoThe subset nomenclature is accord 
jto*thb following binary tree: : .■•..{'.;,: 

ikti-Xk ^Ao » 



Uncoded \ 
Data Bits / 



Binary 
i Encoder 




Subset 
Selector 


Coded \ 









"Channel Coding" 



Remaining 
Data Bits 



0 



Selector 
of Points 
rom Subsets 



Signal Points 



Fig. 10. General coding scheme. 



pling "source coding" from "channel coding" and is im- 
portant both conceptually and in implementation. 

We now show how this general scheme can be applied to 
both block and trellis codes, with performance approach- 
ing the R Q estimate. 




ORIGINAL CONSTELLATION 



/"X /""X f N 

tf-'-'fi ' A A A A A A A 




^menclature ; is different from that of Ungerboeck 
.JwfiiftuSes ;a more w natural subscript notation that 
Becfetlie "successive binary partitions; our nomenclature 
"]^usHul : in the ribxt section/ where we shall use the 
Spiderit by inspection) that at both the 4-subset and 
Smet^ievels; the ^minimum squared distance between 
"^siibsetSi r say;- isv ld\ times the Hamming dis- 
Jffietweeti • their subscripts: '! e.g., d 2 (A 0 > A x ) -» 2d\\ 
m*li)'= 2d 2 0 ; dHA^ Aii)^ ^; and so forth, 
fee subsets may then be used to implement relatively 
|8&but effective coding schemes, illustrated in general 
^felO. iCertaih incoming data bits are encoded in a 
^eiicoderi resulting in a larger number of coded bits, 
ded bits are then used to select which subsets are to 
Rfbr each symbol. The remaining incoming bits are 
^ed&but. merely select \ points from the selected 
S^with ^the .signal constellation chosen large enough 
^mmddate all incoming bits. The coding scheme is 
fervor less decoupled from the choice of cohstella- 
tM^ig'tisVit is of the rectkhgular grid type. The 
5 i^^is n effectively determined by the distance prop- 
^fftfil? subsets combined with those of the binary 
p^trdlessi of the size of the constellation. Oh the 
Jfitod/n'tKe constellation sizei boundary, symmetries, 
" uncoded" properties such as were investigated 
3ier sections are more or less independent of the 
iSlbits arid are determined by the mapping of the 
* ig bits. This may be regarded as effectively decou- 



VI. Block Codes 

In Section IV-B we saw what could be achieved by using 
points from an AT-dimensional rectangular lattice and using 
an AT-sjphere, rather than an JV r cube, as a boundary. 

The rectangular lattice comprising all iV-dimensional 
vectors with odd-integer coordinates is not the most densely 
packed for any N greater than 1; for example, for = 2, 
the hexagonal lattice is 0.6 dB denser, as we have seen. 
Finding the densest lattice in N dimensions is an old and 
well-studied problem in the mathematical literature. Table 
IV gives the densest packings currently known for all N up 
to 24 and selected larger N, with the improvement in 
packing density over the rectangular lattice given in abso- 
lute and in dB terms [16]. The dimensions N = 4, 8, 16, and 
24 are locally particularly good and are known to be 
optimum in the sense of being the densest possible lattice 
packing in these dimensions. 

. A body of recent work [11], [12], [18]-[21] generalizes the 
Campopiano-Glazer construction to N dimensions by tak- 
ing all points on the densest lattice in JV-space that lie 
within an JV-sphere, where the radius of the sphere is 
chosen just large enough to enclose 2 mN points, to send m 
bits per dimension. These codes obtain a "coding gain" 
over PAM v/hich is a combination of both the lattice 
packing density gain of Table IV ( <f channel coding") and 
the AT-sphere/JV-cube boundary gain of Table III ("source 
coding"). The resulting coding gains achievable for JV = 2, 
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TABLE IV 
Energy Savings from Dense Lattices 



TABLE V 
Combined Energy Savings 



Density 



Gain 



dB 



1 
2 
3 
4 
5 
6 
7 
8 
9 
10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 



24 
32 
36 
40 



.500 
.269 
.177 
.125 
.086 
.072 
.063 
.063 
.044 
.037 
.035 
.037 
.035 
.036 
.044 
.063 
.063 
.072 
.088 
.125 
.177 
.289 
.500 
1.000 
1.000 
2.000 
16.000 
16832.947 
4194304.000 



1.00 

1.16 

1.26 

1.41 

1.52 

1.67 

1.81 

2.00 

2.00 

2.07 

2.18 

2.31 

2.39 

2.49 

2.64 

2.83 

2.89 

2.99 

3.10 

3.25 

3.39 

3.57 

3.77 

4.00 

4.00 

4.16 

4.59 

8.00 

6.44 



.00 
.62 
1.00 
1.51 
1.61 
2.21 
2.58 
3.01 
3.01 
3.16 
3.38 
3.63 
3.76 
3.96 
4.21 
4.52 
4.60 
4.75 
4.91 
5.11 
5.30 
5.53 
6.76 
6.02 
6.02 
6.19 
8.62 
7.78 
8.09 



N 


Gain 


dB 


2 


1.21 


.62 


4 


1.57 


1.96 


6 


2.36 


3.74 


18 


3.54 


5.50 


24 


5.16 


7.12 


32 


5.24 


7.19 


48 


8.04 


9.04 


64 


8.69 


9.40 



. 1 > 

■r. :«>. 



i 



4, 8, 16, 24, 32, 48 and 64 are shown in Table V. Because 
the number of near neighbors in these densely packed 
lattices becomes very large, the total number of error 
events ("error coefficient") becomes large, which reduces 
the coding gain realized in practice. Also, the mapping of 
all m N bit combinations to their corresponding signal 
points can be an enormous task, even if all possible sym- 
metries and simplifications are cleverly exploited [20], [21]. 

We will now show that certain of these dense N-dimen- 
sional lattices can be constructed using 2-dimensionai rect- 
angular lattices, the subset partitioning idea, and simple 
binary block codes. In particular, we shall give construc- 
tions for N= 4, 8, 16 and 24 that form a natural sequence 
both in complexity and in nominal coding gain (respec- 
tively 1.5, 3.0, 4.5, and 6.0 dB, using the simplest imple- 
mentations). (Cusack [21a] has recently shown how to 
construct dense 2"-<limensiohal lattices from 2-dimensiOnal 
lattices using Reed-Muller codes, for any n; for N = 4, 8, 
and 16, the lattices obtained are the same as those we 
obtain here.) 

To generate the optimum AT-dimensional lattices for 
N = 4, 8, 16, and 24, we shall use sequences of 2, 4, 8, and 
12 points from the 2-dimensional rectangular lattice, parti- 
tioned as shown in Fig. 9, into 2, 4, 8, and 16 subsets, 
respectively. 

The 4-dimensional lattice is generated by taking all 
sequences of two points in which both points come from 
the same subset, i.e., sequences of the form {A, A) or 

(B,B). 1 ■ . 

The 8-dimensional lattice consists of all sequences of 
four points in which all points are either A points or B 
points and further in which the 4 subset subscripts satisfy 
an overall parity check, i x + i 2 + i 3 + U = 0; e S-> sequences 
of the form (A 0 , A 0 , A 0 , A 0 ), (B 0) B v B 0 , BJ, and so forth. 
(In other words, the subscripts must be codewords in the 
(4,3) single-parity-check block code, whose minimum 
Hamming distance between codewords is 2.) 

The 16-dimensional lattice consists of all sequences of 
eight points in which all points are either A points or B 



points, and further, in which the 16 subset subscripts je 
subset now having two subscripts) are.codewordsfir 
(16,11) extended Hamming code, whose minimum 7 
ming distance between codewords is 4. - • ^-liwut- t^^ 
The 24-dimensional lattice consists of all. sequence^! 
points in which all points are either 04 points or; fifpomy 
the 24 (i, j) subscripts are codewords in the (24,12) :<Mg 
code, known to have ininimum Haniming distance v 8j3a 
further, in which the third. subscripts./:, are cqnstratoedjja 
satisfy an overall parity check in the f ollowing ^y*pH 
sequence is of all ,4 points; then overall k ypanty'te;wr 
(an even number are equal.to 1), while if . the sequence^ 
all B points, overall,* parity is odd:, V; : -A } §J 
If the minimum squared distance between points^ 
2-dimensional constellation, is X4l> '. ^\v%«pfclfflL_ 
squared distance between *Vcto&\teWfcl&i#£M 
higher-dimensional lattices^can >be. sho^.to be.2<i„;^y 
%d\, and \bd\, respectively" as, follow,?!, 

a) A sequence of A points and a 1 sequence of ,B-$om 
differ from each other tysqua?ed;distajK>e.at J^^flf 
every point and therefore by/M { ^til4l*"M»M*$JL 
\2dl in total. Iafact,'m\the.24^dimensi^^ase^<^| 
distance of at least ,5<tf tin' at^leasCpfie. i5yt^oU f W 
minimum squared) distance between '^sequences; anM 
sequences is at least 16rf£>'Thet»pippf.; iWltf 
properties of the •.Golay, i cpde.j aStaW^as.^jparij 
partitioning shown in Fig. 9 and.is, in^the Appendix.^ 

b) Two diffeient ..sequencesiiwith'" pojnis^aB^frp^^ 

same sequence of subsets must^^ift&O^f SSOT 
by the minimum. within-subseti ^^squared : idlstance, , »h^r 

2d\, Ad*, Ml; W'16dl\'ttsp9c^4ym^^^ y ^'^ 
establish 2d\ as : the nunimum ;squared/dis.^ 

sequences in the dimensional -c^a^^l^l^^L 

c) For N = 8, 16, and 24,^the^(-f,^KsubseJjsub|^ 

are drawn from <4;3Vf,A«. 11 M^(^^»g«^ 
minimum Hamming distances 2;\4^afld^.r^e0iyea 
the relation between subscript Hanmimg distance^^ 
subset squared distance,.'.^ ^Hk^US^m^^k 
two sequences with pomts.draw^>frpmisubsetec0^hM 
type (A or B) .but .'different;!/ '.otl(f7!t)^b^5$^. 
differ by squared distance ••atj:leasti4d^8J^Qr.j|^ 
respectively. This is MwXnee&J&Zitety^ffi^ 
sional cases. ' v 

d) For N = 24, two sequences of points : from;subse^ 
the same type and with, the same|(/i V)f.subscrip.ts^ 
different k subscripts must differ by atleast 8</„. in at;le" 
two symbols because of the overall k parity check, andJ 
fact that the minimum squared :distance:between pomtsj 
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[fee type and- with .theSame (i, J) subscripts is id*. 
tecondudes the 24-dimehsional proof. • 
feo send m bits/symbol using these lattices, we need to 
Ka^ofU^imio one of 2^ lattice point 

Ibshbuld be ••chosen- iho%ever, implementation of the 
B^rron^ 

^*Simp^ 
ft^^i^^oMtetruct- the lattices, and the constel- 

feth <the;meth6T6f :sending fialf-integr^number^f 
fembbl given^hV: sWon rV^(for AT = 4 and 16). The 
BP^^^ifeli^ small, ganging from a few 

fe#is i l Up«toda^;tte;^Kete/JV-cube gain 

llntable^ ': ' u- t tft 

fct«block^»{KSb'its; we always^se one bit to 

^wheiherU orrB . points will be used. For JV- 8,16, 

SB^^further' setrof bits is used as input to a binary 

mer^wbici'pr^u^-apprbpAate codewords to be 

Subset 'subscript^ighatprs:- 3 bits to produce 4 

afeg, 11 bits to produce 16,for ;N = 16, and 12+11 - 23 

feoduce 24 + 12 = 36 for :N = 24. Thus, a total of 1, 

j&eti; or :f,,l,li, or 2>its/symbol, respecUvety- 
Remaining bits are used ^ select points from the 
^Isubsets-vWeiuse 4 the rectangular, constellations ot 
^fand,8^ai';folIows:^or / .JV = 4, constellations of 
^points', as.in Fig^8,:divided.into two 1.5X2 - 
»sets,;^and^;;for ( ^ = 8, constel lations i of 2 
fcin-.Fig^ ;i 3; divided into four r-i-point subsets, 
»i6rcoWeUati6ns;of,1.5x2^pomt Sj asmFig 8, 
«£to,eight.l;5 x 2^-point subsets; ^d f or// -24 
■Hfta»-of i % +a . Points as in Fig, 3, divided into 16 
g^^t8>(fa\-aD : .cases.m must.be large enough 
fehe subsets: resulting from the partitioning have 
Me*, and. other desired properties, e.g., symmetries.) 
fe'be used, to send m -\, m -1, m-\\, or m-2 
fcbbUwhere for. -N = 4 and 16, the method of 
fehalf-ihtegral bits/symbol of Section IV-C may be 

J^!n^^S^^ian^e in TV-space is 2, 4, 
Regimes, the. minimum, squared distance for an 
■l^Soint constellation, there, is a distance gam of 
iPfofel&B; respectively.. However, the expanded 
ISatibns required with coded modulation cost 1.5, 3, 
»6\dB, respectively, yielding a net coding gain of 
ife^and 6 dB for TV = 4, 8, 16,and 24. The family 
fehipVof^s progression of codes is apparent 
KSiUkelihoOd sequence, detection of the lattice 
M&st^a sequence of ( received points is easy for 
PSnd,8»General methods are given in [22]. For TV - 8, 
^received points, assume first that ,4 points were 
fSid the closest A point to each received point, and 
subscript parity of the four, subsets tentatively de- 
«Lthe (P arity check fails, change the least reliable 
Ki to the next closest A point (which must be in the 
^ subset). This gives the best A sequence satisfying 
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the parity-check constraint. Repeat, assuming that B points 
were sent, to get the best B sequence. Compare the bestj 
and B sequences, and choose the better as the final deci- 

81 Detection for N-16 or 24 is harder, mvolving either 
generalization of soft-decision decoding of the (16,11) or 
f24 12) block codes to perform error correction on tenta- 
tive decision subscripts (as above; the method of changing 
the least reliable decision if an overall parity check fails 
was used in the 1950's as a soft-decision error-correction 
method for single-parity-check codes by Wagner), or ex- 
haustive search of a neighborhood of the received sequence 
in JV-space. Note that as its first step the decoder can 
always choose the closest point in each subset Ureach 
received point as representative of that subset, there bemg 
no reason to prefer any- more distant point and then 
proceed to determine the best sequence of subsets using 
Lse points (with their distances from the > received port) 
as proxies for the corresponding subsets; the decoding task 
may thus be partitioned in the same way as coding is 
partitioned in Fig. 10. 



VII. Trellis Codes 

On power-limited channels (such as the satellite channel), 
convolutional coding techniques have more or less become 
the standard (although there are some who continue to 
champion block codes [23]). Generally, anything that .can 
be achieved with a block code can be achieved with some- 
what greater simplicity with a convolutional code. We have 
j"st seen that relatively simple (TV- 8) block codes can 
achieve of the order of 3 dB coding gam on band-hmited 
channels, and relatively complex (TV = 24) block codes can 
adueve of the order of 6 dB. We shall now see that trelhs 
codes can do the same, perhaps a bit more simply. 



' 1 !'f 



A. Ungerboeck Codes 

For band-limited channels, the trellis codes came first, in 
the work of Ungerboeck [15]. In Ungerboeck s paper, to 
send n bits/symbol with two-dimensional modu ation a 
constellation of 2« +1 points is used, partitioned into 4 or 8 
subsets. 1 or 2 incoming bits/symbol enter a ra e-* or rate- , 
binary convolutional encoder, and the resulting 2 or 3 
coded bits/symbol specify which subsets to be usel The 
remaining incoming bits specify which point from the 
selected subset is to be used. . , u - »> 

The coding gain obtainable increases with the number M 
of states in the convolutional encoder. Urtgerboecks sun- 
niest scheme uses a 4-state encoder and achieves a nominid 
fdB tdfog gain (a factor of 4, or 6 dB, in mcreased 
sequence distance, less 3 dB due to use of the larger 
2» +1 -point constellation). His most complex scheme uses a 
128-sfate encoder and gains 6 dB (the limit with 8 subsets 
and a 2" +1 -point constellation since the within-subset dis- 
tance is s/for a 9 dB gain, less the 3 dB due to the larger 
constellation). Table VI gives the coding gains obtained by 
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TABLE VI 
Coding Gains for Ungerboeck Codes 



State* 


Gain 


dB 


4 


2.0 


3.0 


8 


2.S 


4.0 


16 


3.0 


4.8 


32 


3.0 


4.8 


64 


3.5 


5.4 


128 


4.0 


6.0 




8-State Trellis 

, Fig. 11. Ungerboeck 4- and 8-state trellis codes. 



Ungerboeck for these and intermediate numbers of states. 

Decoding is assumed to be by the Viterbi algorithm [24], 
a maximum likelihood sequence estimation procedure for 
any trellis code. The complexity of such a decoder is, 
roughly proportional to the number of encoder states. With 
these codes, each branch in the trellis corresponds to a 
subset rather than to an individual signal point; but if the 
first step in decoding is to determine the best signal point 
within each subset (the one closest to the received point), 
then that point and its metric (squared distance from the 
received point) caii be used thereafter for that branch, and 
Viterbi decoding can proceed in a conventional manner. 
Fig: 11 gives trellises with branches labeled by subset for 
Ungerbpeck's 4-state and 8-state codes. 

(Note: the block codes of the previous section can be 
represented as treilis^s; : Fig. 12 shows the trellises corre- 
sponding to the N = 4 and N = 8 codes. Viterbi decoding 
could therefore be used for them as well. It is interesting 
that the block code with 3 dB coding gain is also associated 
with a Estate trellis, albeit decomposable into two parallel 




N»4 







At / 






'* 2 


' A, 




A. ; \? 

« V • ■ v ■ 


s B. 




' B. \ 








By/\. 


B. 






N«8 



Fig. 12. Trellises corresponding to N - 4 and N « 8 blockc 

2-state trellises. The iv" = 16 • block code c^sinma w 
associated with a 64-state (trellis 'decomposabletintQi 
parallel 32-state trellises, and the yv« 24 block;c<^g 
associated with a 2X^?6-kate^ 

8-state trellis cod^'with nominal 4 dB cb^g g^S 
the process of being 'adopted ?as* intemaiioH^S 
standards for 9600 bit/s^transmis«on?pver^ 
(dial) telephone network [25] and potentially for|l4^ 
transmission over private lines as* well [26].^slig^ 
[27] of the Ungerboeck scheme" involv^^nral^p 
volutional encoder is being used' in these^stydiar^l 
this variant, whose trellis' '''isTshowtf !in^Fig^3 ^ | 
rotation of a coded sequence is'another" coded^se^uena 
that differential coding techniques may be^used^IM 
tance properties and therefore .^ro^ 
are apparently identical to those'p^ 
scheme.' ' ^ : ^:M0^M' 

B. Other Trellis Codes J y£ ^'-^^SSfSa 

• * • ■ • < r V^SSL 

The Ungerboeck codes seem to cover theTpngew 
ble coding gains with complexity of the - prder? ofl ME3 
might expect, and may' : thmfore;be^^^^|M^ 
of how much complexity is' neeidedrto? achiWeS ai ^ 
coding gains in the 3--6 dB r^^'i^^th^be^imw 
upon? From our research, the ^$wer;sten^Ho'bej^«^ 
riot very much. In this section* we/shM^descnMT 
schemes that exhibit modest improvements and somgfi 
ideas: a 2-state code that has a ' nominal) co^jing^gajM 
almost 3 dB, and an 8-state trellis code with'a/cpdii^^ 



et at.: modulation for band- limited channels 



643 



Aoo. Aoi, VAit,- Aio ><* 0 
Boo, Bio, v Bui B01 V 1 
AS,' Aoo", 'Aioi An 'H r r 2 



5 « v * 



Boi, Bn, Bioi Bio v, 3 
An, AiOi Aooi ,Aoi j. .^4 

B«; Bol'Boo. Bio?i ' 5 
AiAur^Abb^^- 

Bio, Boo, Boi, BH/ h 7 

Fig., 13.j -Nonlinear 8-state trellis code with 90° symmetry. 

4;5|dB;i t The.:iiew ideat in the 2-state scheme is to use 
sctslthat are partially overlapping and partially distinct, 
new idearin l,the-.8-state scheme is to use a 4-dimen- 
al- constellation: rather than 2-dimensional as the basic 
ftllatiori;:(4-dimensional trellis codes have also been 
ai5d by Wilson [28] and Fang-*/ al. [28a].) 

VtstateTrellti Code:; If a 2 n -point signal constellation 
partitioned into two subsets of A points and B points, 
Ufcliannel ^coding lis done using the (infinite) 2-state 
s 'shown at the top of Fig. 14, it would appear at first 
ce that a 3,dB v gain is obtained at no cost. This scheme 
bits/symbol with no signal constellation expan- 
furtheryany two sequences that start at one common 
and end at another differ by a squared distance of at 
idli .because, the paths. differ by at least d 2 Q when 
(ii verge and , another 5 . dl when they merge, so the 
ffinal coding gain is apparently a factor of 2 or 3 dB. 
course, we cannot get something for nothing, and the 
ipdn. this scheme , (called "catastrophic error propa- 
<m"tiri the convolutibnal coding literature) is that there 
paths of infinite length starting from a common node 
never* remerge< and. have squared distance only rf*, 
ely'any two paths of the form AXYZ • • • and BXYZ 
(The "error coefficient" is infinite.) 
One way of curing this problem is to terminate the trellis 
ry^ symbols by forcing it to a single node, illustrated in 
'4(b). In other words, at the bth symbol, the subset is 
itfained ltd be A or B 9 as necessary to reach the 
'griated node. Only,w -1 bits can be used to determine 
#h symbol, so there is a cost of 1 bit per b symbols of 
mission' capacity, but now a legitimate coding gain of 
iffijis obtained minus (l/b)x3 dB for the rate loss. The 
ace block code would operate in just this way if it used 
^ points, partitioned into A 0 and A x (see the top half 
trellis in Fig. 12); happily it is possible to insert a 
ar code made up of B points into the interstices of the 
e lattice without compromising distance, and the 
tional bit involved in specifying A or B compensates 








Fig. 14. 2-state trellises, (a) Infinite nonredundant 2-state trellis, 
(b) 2-state trellis terminated every 4 symbols, (c) Time-invariant 2-state 
trellis with A' * A'\ B' * B" . 



B" A'^B A 

A' bJa B 

A'l B A B A 

B* | A B A B 

A' J B . A B A~ 

B' ,_A _B_ A B 

B* A" | B A 

A' B' t_A_ B_ 

B" A - 



A" B" 

B A~l B' A' 

A B l_A^_ B* 

B A B A~l B' 

A B A B | A' 

B A B A I B' 



A B | A" 

I 

B A B' A' 



B 



Fig. 15. 80-point signal constellation. 32-point subsets A\ A", B\ B" 
have 24 common points^ uniquej>oints; e.g., A'~ 24 A inner points 
plus 8 A' outer points. S,= 31.3; S 0 = 80; S = 43.5 (16.4 dB). 



for the bit lost at the fourth symbol, and allows a full 3 dB 
gain. This terminated trellis code may be regarded as a 
generalization of a single-parity-check block code. 

Another way of gaining almost 3 dB while using a 
time-invariant trellis code is as follows. The signal constel- 
lation is modestly expanded to include (1 + p)2 n points, 
arranged on a rectangular grid and divided as usual into A 
and B subsets of (1 + p)2 n ~ l points each. The A and B 
subsets are further divided into (1 - p)2 n ~ l "inner" points 
and 2p "outer" points. Finally, sets A\ A'\ B\ and B'\ 
each of 2"" 1 points, are created as follows: A' and A" 
both include the (l-/?)2 n_1 inner A points, but each 
includes a different half of the 2p outer points; and 
similarly with B' and B". Thus A' and A" are partially 
overlapping and partially disjoint, and so are B' and B". 
The probability that a random choice from A' will not be a 
member of A" is p. Such a construction is illustrated for 
n — 6 and p — 0.25 in Fig. 15. 

Now the sets A\ A'\ B\ and B" may be used in a 
2-state trellis as illustrated in the last part of Fig. 14. The 
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squared distance between two paths beginning and ending 
at common nodes remains 2dl since the basic distance 
properties between A subsets and B subsets remain. But 
now, although there are still pairs of infinite sequences 
comprising only inner points that start from a common 
node and never accumulate distance of more than d], their 
probability is zero. The squared distance between any 
branch that uses an outer point and its counterpart branch 
must be at least 2d] since the counterpart branch cannot 
use the same outer point and the distance between differ- 
ent points in the same subset is at least Id*. For practical 
purposes, this means that a sequence containing such a 
branch cannot be confused with a sequence containing the 
counterpart branch, their squared distance, being at least 
3d% 9 so that in effect, whenever an outer point is sent, 
reconvergence to a common node is forced, as in the trellis 
termination method. Here, however, the reconvergence is 
probabilistic and happens on average every l/p symbols, 
e.g., every 4 symbols if p = 0.25. 

There is a slight reduction in power due to the increased 
constellation size; e.g., the average power using the constel- 
lation of Fig. 15 is 43.5 or 16.4 dB, versus 42 or 16.2 dB for 
the 8 X 8 constellation, or 41 (16.1 dB) for the Fig. 5 n = 6 
cross constellation. Use of an integral approximation gives 
an estimate of additional power required of a factor of 
1 + p 2 , or 1.0625 (0.26 dB) for p = 0.25. This can be made 
as small as desired by reducing p\ the cost, however, is a 
greater average time to converge and a greater average 
number of near-neighbor sequences ("error coefficient") 
increasing inversely proportional to p\ for this code the 
"error coefficient'* is rather large and must be taken into 
account. Any p greater than zero in principle avoids 
catastrophic error propagation; a p of about 0.25 seems a 
good choice in practice. 

2) 8 -State Trellis Code: For the 8-state four-dimensional 
scheme, we shall use a two-dimensional rectangular grid 
divided into four subsets as before. The binary convolu- 
tional encoder for this scheme, however, operates on pairs 
of symbols rather than single symbols. An appropriate 
encoder is shown in Fig. 16. During each pair of symbol 
intervals, three bits enter the encoder and four coded bits 
are produced. The first two coded bits select the subset for 
the first symbol and the second two bits select the subset 
for the second symbol. 

If the Hamming distance between two encoded se- 
quences is K y then the squared distance between the map- 
pings onto grid points is at least Kd%. We now show that 
the minimum free Hamming distance of this convolutional 
code is 4. First note that the response of the encoder to a 
' single 1 on any input line is a sequence with even weight, 
from which it follows that all encoded sequences have even 
weight and the minimum free distance is even. By inspec- 
tion, it is easy to verify that there is no encoded sequence 
of weight 2, and a simultaneous 1 on all inputs yields an 
encoded sequence of weight 4. Thus, the squared distance 
between any two sequences corresponding to different 
' encoded outputs is at least 4 d\. If the encoded outputs are 
the same, but different elements are chosen from the same 
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Bo : : ,o 
B,-' " 1 
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Fig. 16. Encoder for 8-state four-dimenao 



subsets, then again the squared disti^M 
Suppose now that the signal constellatior 
points. Then, over two symbol interva^ 1 '* 
the modem and one parity >h^i?^|S£ 
bits to select the two signal points. SmceaS 
enter the modem, there is a'loss'of 4:5jg|^ 
signal constellation and, a gain qf 6 dj^^st 
nominal coding gain of 4.5 dB. . V^K^^Kfc 
Section IV discussed encoding for^a half-rat 



of bits/symbol, and that method l^^j^^ 
alternative which is somewhat more^attKic^ 
integer number n of bits/symbol|en^^ 
three bits entering the a>nvoluUonal^^dc 
symbols and In -3 bits entering^^g 
coder as described in Secdon? IV ;|T^ 
integer number of bits/symbol and^also^^ 
possible 1.33 dB for nonuniform 'prqbabdUtie 

VIII. Concluding' Re marks! 

• ' +r\ji-tt*V & ^ 

It has not been possible in ^ ^ s >pai>^^^| 
of topics that are of importance infpracti^"^ 

The only channel disturbance consi'd^^ 
Gaussian noise. Other di s ^^^|M^][^ 
on telephone channels. Thereisfsorn^ ^^ M 
ence that the coded modulationvschemejr 
robust relative to uncoded 'schemesftr^ 
dieted by Gaussian noise calculatio ^a g^ 
portant disturbances,', such * as t inpnlm^^ 
phase jitter, perhaps due .to the.memw^^ 
modulation and sequence • esumation|oy^ 

bols. ' • ^''V '-q^^^^ 

Because of the symmetries of attract^| 
e.g., the 90° symmetry of most qf v oinp©ct^^ 
tions, there may be an ambiguity in!]^^^| 
In general, there are two ways toHm^^ 
with coded modulation. If the v 9°de>^^^ 
sequence basis, every 90°i rotadpnfojO^g 
another legitimate code sequence,?^^^^ 11 
by differential quadrantal coding^t^E^^ 
transmission transparent to 90°^rotatip^gB 
90° rotations do not give valid w code^Mijg^ 
then it will be possible eventually ftr^^^jj 
force receiver phase to a vahd:settingj|[J« 
nique is generally preferred.; Ol foejqjdS^ 
cussed, the block codes generaUy ar^jiiffg^ 
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iK-Wllis-'^es'w'gwerany-iiot, although they can 
^gfe;mbdified;to;be; e^., the modification of Fig. 11 

KnaUy, we have mostly used nominal coding gam as a 
Ifeof merit of coding schemes. In fact, error probabih- 
feor cbded' systems on Gaussian channels are typically 
Gkf the form P{E)= *exp(- E), where the exponent 
lis* governed by 'the iiominal coding gain and the error 
Efficient" K~, is. of the order of the number of coded 
ES&ehces at minimum distance from an average trans- 
"Stied sequence/ .In general, the error coefficient 

|5f increases with the complexity of coding; 
jofcan cost a- significant fraction of a dB for coding 
Knes with moderate (3-4 dB) gain, for error probabih- 
ten the 10: 5 -10" 6 range; 

m$m become very large for schemes with large (6 + dB) 

rasSiaiv^^^:^^^*.^" 2 f» or the most 

JfetisigerieraUy significantly larger for block codes than 

Bte^^ A ' :< ^^"* le,,i0,nilial C0ding gain ' 
^Pus.;the error, coefficient cannot be ignored in a more 

^wi assessment of 'coded systems. 
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b) For A points, i = 0 iff y = 0mod4, and ;' = 0 iff 
x = 0mod4; for B points, i = 0 iff y = lmod4, and ; = 0 
iff x=lmod4. " . 

c) For ^ points, x + y = 2/ + 2; +4fcmod8; for 5 
points, x + y = 2/+2;+4fc+2mod8. 

Two sequences with points from different groups differ 
by at least 1 in every one of the 24 (x, y) coordinates. They 
cannot all differ only by 1, however, because of the follow- 
ing. The sum S of all coordinates satisfies 



5= 2w ( ,-+4w ife modulo8, 
2Wij + 4w k modulo 8, 



for A sequences; 
for B sequences 



where w u is the number of (/, j) subscripts equal to 1, and 
w „ the number of k subscripts equal to 1. But since the 
subscripts form a Golay code word and all such 
words have weights equal to integer multiples of 4, and 
since w k is even for A sequences and odd for B sequences 
by construction, 



S = 0modulo8, 
4 modulo 8, 



for A sequences; 
for B sequences. 



'.<<<"■ 



VIII; Summary 



Ethe band-limited channel, dense packing of 2-dimen- 
ffial* constellations with optimal (circular) boundaries 
fes less than l 5r dB improvement over simple pulse amph- 
fifeoduiatibn! JJncoded schemes in higher dimensions 
"Tpernatively, source coding can gain somewhat more 
gi r dB b^u^ ! M^\^t8^*' nonunifonn proba- 
mfe'.These gains , pale by, comparison with what can be 
apnea with (channel) ;coding, where relatively simple 
trelUs codes'easily yield coding gains of the order 
ggaB, or 1 bit/symbol. Relatively complex block and 
fecodes have been constructed that yield of the order 
#dB, or 2 bits/symbol. Because this is as much gain as 
PS be predicted using the R B estimate and is only 3 dB 
How the capacity limit, it seems unlikely that further 
Ejor improvements' .are possible. However, within the 
"Strum of performance of already known schemes, there 
mf likely be some further embellishments that will reduce 
Splernentation complexity or have other desirable proper- 
^fsuch as the differentially coded variant of Un- 
Joorok's 8-state trellis code that is likely to become an 
Sernational standard. . 



., • ' " Appendix 
[Proof That 24-Space Lattice Has = 16</* 

Ke ? grid of triple-subscripted signal points illustrated 
BRg 9 is rotated 45° with a point A^ at the origin, and 
J^;so''that: the,,coordmates (x,y) of all points are 
ffiegers; then thefollowing hold true, 
fe For A points, both x and y are even; for B points, 
*th x and y are odd. 



Now suppose that there were a B sequence that differed 
from an A sequence by +1 in, every coordinate, and let m 
be the number of coordinates in which the difference was 
+ 1. Since the sum S A of the A coordinates is 0 mod 8 and 
the sum S B of the B coordinates is 4 mod 8, and S A -S B - 
m -(24- m) = 2m mod 8, it follows that 2m = 4mod8, or 
m = 2mod4. Now, the construction of the array is such 
that if a B point has an x coordinate 1 larger than the x 
coordinate of an A point, then the ; subscript is the same, 
whereas if it is 1 smaller, then the subscript is different; 
similarly a difference of +1 in y gives the same i sub- 
script, while a difference of -1 gives the opposite one ; 
Thus, m = w iJ . But, w o = 0mod4, so w = 0mod4; con- 
tradiction. Hence, any B sequence must differ from every 
A sequence by at least + 3 in one coordinate, Q.E.D. 

This lattice and its distance properties were originally 
discovered by Leech [29]. 



References 

[11 C. E. Shannon and W. Weaver, A Mathematical Theory of Com- 

P1 rSaS^S and^odula- 

I tion for probabilistic coding," IEEE Trans. Inform. Theory, vol. 

Ml i\ 12 M^sey!''C 2 odi'ng 9 ^d modulation in digital communications/' 
' in Proc 1974 Int. Zurich Seminar Digital Commun., Zunch, 

[51 ?1ftt?affa^Kffls, Principles of Communication 

,61 ftrt& T S^and^J. Weldon. Jr., Principles of Data 

II Communfcanons New York: McGraw-Hill, 1968, ch. 3 

[71 C R Cahn, "Combined digital phase and amplitude modulaUon 
11 conVmunica ion systems," IRE Trans. Commun. Syst., vol. CS-8, 



(81 
[9] 



4\ 

' t -f-B 



•pi 
km 

m 



? C 5 Han 5 cUc 96 L r! W. Lucky. "Performance of combined 
amplitude and phase-modulated commumcauon systems, IRE 

fude andT^e^ulation scheme," IRE Trans. Commun. Syst., 
vol. CS-10, pp. 90-95, 1962. 



M 



ill 



'Mm 



m 
m 

■ • : ■ , tjJ*;| 
'■: ■■%>] 

■.-ivf 



646 



IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, VOL. SAC-2, NO. 5, SEPTEMBER 1984 



*[U)] R. W. Lucky and J. C. Hancock, "On the optimum performance 
of W-ary systems having two degrees of freedom IRE Trans. 
Commun. Syst., vol CS-10, pp. 185-192, 1962. 

[11] J. H. Conway and N. J. A. Sloane, "Voronoi regions of lattices, 
second moments of polytopes, and quantization," IEEE Trans. 
Inform. Theory, vol. IT-28, pp. 211-226, 1982. 

[12] J. H. Conway and N. J. A. Sloane, "A fast encoding method for 
lattice codes and quantizers/' IEEE Trans. Inform. Theory, vol. 
IT-29, pp. 820-824, 1983. 

[13] R. W. Lucky, "Digital phase and amplitude modulated communi- 
cation systems," Ph.D. dissertation, Purdue Univ., Lafayette, IN, 
1961 p. 89 (attributed to Cahn). 

[14] G. J. Foschini, R. D. Gitlin, and S. B. Weinstein, " Optimization of 
two-dimensional signal constellations in the presence of Gaussian 
noise," IEEE Trans. Commun., vol. COM-22, pp. 28-38, 1974. 

[15] G. Ungerboeck, "Channel coding with multilevel/phase signals," 
IEEE Trans. Inform. Theory, vol. IT-28, pp. 55-67, 1982, 

[16] J. Leech and N. J. A. Sloane, "Sphere packings and error-cor- 
recting codes," Can. J. Math., vol. 23, pp. 718-745, 1971. 

[17] G. R. Welti and J. S. Lee, "Digital transmission with coherent 
four-dimensional modulation," IEEE Trans. Inform. Theory, vol. 
IT-20, pp. 497-502, 1974. 

[18] Canada, "An aspect of future modem development," CCITT Con- 
trib. COM-Sp.A No. 142, 1974. 

[19] N. J. A. Sloane, "Tables of sphere packings and spherical codes," 
IEEE Trans. Inform. Theorv, vol. IT-27, pp. 327-338, 1981. 

[20] P. deBuda, "Encoding and decoding algorithms for an optimal 
lattice-based code," in ICC Conv. Rec, pp. 65.3. 1-65.3.5, 1981. 

[21] Canada, "Block coding for improved modem performance," 

CCITT Contrib. COM-XVII No. 112, 1983. 
[21a] E. L. Cusack, "Error control codes for QAM signalling," Electron. 
Lett., vol. 20, pp. 62-63, 1984. 

[22] J. H. Conway and N. J. A. Sloane, "Fast quantizing and decoding 
algorithms for lattice quantizers and codes," IEEE Trans. Inform. 
Theory, vol. IT-28, pp. 227-232, 1982. 

[23] E. R. Berlekamp, ''The technology of error-correcting , codes," 
Proc. IEEE, vol. 68, pp. 564-593, 1980. 

[24] G. D. Forney, Jr., "The Viterbi algorithm," Proc. IEEE, vol. 61* 
pp. 268-278, 1973. 

[25] Rapporteur on 9600 bit/s duplex family modem, "Draft recom- 
mendation V.32 for a family of 2-wire, duplex modems operating 
at data signalling rates of up to 9600 bit/s for use on the general 
switched telephone network and on leased telephone-type circuits," 
CCITT Contrib. COM-XVII No. T30, Geneva, Switzerland, Mar. 
1984; also see J. D. Brownlie and E. L. Cusack, "Duplex transmis- 
sion at 4800 and 9600 bit/s on the general switched telephone 
network and the use of channel coding with a partitioned signal, 
constellation," in Proc. Zurich Int. Sem. Digital Commun., 1984. 

[26] Rapporteur on 14 400 bit/s modem, "Working draft recom- 
mendation V.CC for a 14 400 bit/s modem standardized for use 
on point-to-point 4-wire leased telephone-type circuits," CCITT 
Study Group XVII, 1984. 

[27] L. F. Wei, ' Rotationally invariant convolution^ channel coding 
with expanded signal space— Part II: Nonlinear codes " IEEE J. 
Select. Areas Commun., vol. SAC-2, pp. xxx-xxx, Sept. 1984. 

[28] S. G. Wilson and H. A. Sleeper, "Four-dimensional modulation 
and coding: an alternate to frequency reuse," Univ. Virginia, Rep. 
UVA/528200/EE83/107, Sept. 1983. 

[28a] R. Fang and W. Lee, "Four-dimensionally coded PSK systems for 
combatting effects of severe ISI and CCI ," in Proc. 1983 IEEE 
Globecom Conv. Rec, pp. 30.4. 1-30.4.7, 1983. 

[29] J. Leech, "Notes on sphere packings," Can. J. Math., vol. 19, pp. 
251-267,1967. 




G. David Forney, Jr. (S'58-M , 61-F'73) was born 
in New York, NY, on March 6, 1940. He re- 
ceived the B.S.E. degree in electrical engineering 
from Princeton University, Princeton, NJ, in 
1961, and the M.S. and Sc.D. degrees in electrical 
engineering from the Massachusetts Institute of 
Technology, Cambridge, MA, in 1963 and 1965, 
respectively. 

In 1965, he joined the Codex Corporation, and 
became a Vice President and Director in 1970. 
Since 1982 he has been Vice President and Direc- 



tor of Technology and Planning of the Motorola Information Systems 
Group, Mansfield, MA. During the 1971-1972 academic .year he wa£a 
Visiting Scientist at Stanford University, Stanford,* CA; and he has r ak§ 
served as an Adjunct Professor at the Massachusetts Institute of Technplr 
ogy. . t : v.y^V Wfl x ilfcfc?*Ati$ r ~ 

Dr. Forney was Editor of t the IEEE Transactions on Informatiq 
Theory from. 1970 to 1973 and has held numerous other IEEE portions? 
He was winner of the 1970 Information Theory' Group Prize Paper Aware 
and the 1972 Browder J. Thompson Memorial Prize Paper Award. He,wai 
elected to membership in the National Academy of Engineering in 1983. 

s '. • .0 ><t •.*'.'>. • !■ 

':•».«■ M(. ' /'.".t'iisK* 
.... * fjV 



> ...... .v. 

*■ " ' •> s, ^t% 

Robert G. Gallager:.(S'S8-Nr61-F:68). received' 
the S.B. degree in electrical engineering from the 
University of Pennsylvania,' Philadelphia,* PA?*in 
1953, and the S.M.and Sc.D. ; degrees in electrical 
engineering from the Massachusetts InsUtuteVof 
Technology, Cambridge, MA, in 1957 and I960, 
respectively. ' x - " 

From 1953 to 1954 he <was -a! Member of 
Technical Staff at BeU t Telephone Laboratories, 1 
and from 1954. to 1956 was in the Signal Corps Jf 
the US. Army. He has been with" the Massachu- 
setts Institute of Technology since. 1956 and was Associate Chairman of 
the faculty from 1973 to 1975.. He is currently * a Professor m|the 
Department of Electrical Engineering and Computer Science, Associate 
Director of the Laboratory for Information and Decision Systems^and 
Co-chairman of the Department Area I (ControlJ'Commumcation^ani 
Operations Research). He is a consultant to Codex i Corporation, .Western 
Union, and CNR. He is the author. of the textbooks Information Theory 
and Reliable Communication (New York: Wiley, 1968).' His major research 
interests are data communication networl^/ ; mfonnation l 'medry,^an<! 
communication engineering.- » -n* / > 4 ^C^ 
Dr. Gallager was awarded the IEEE Baker Prize Paper Award in 19j6£ 
for the paper "A simple derivation of the coding theorem, and some 
applications." He has been a member of me Administrative Committee of 
the IEEE Group on Information Theory, from 1965 to 1970 and from- 
1979 to the present, was Chairman of the, Group in ,1971, and was elected, 
a member of the National Academy of Engineering in 1979. 

" - ■• .... >iih ■ £rtii?i£f^ 




• '.-./= \:\ r'i wv&f *>\&* * V£S$*8S 

Gordon R. Lang was born on September .7,. 
in London, Ont.,..Canada. A He received , the ELScI 
degree (Hons.) in mathematics and physics, radio' 
physics option,' from the University' of Western 1 
Ontario, London, in 1949. J >--}u : ph 
He was with Ferranti Electric Ltd. from 1949 
to 1965. He was a founding member of ESE 
Limited (now - Motorola ' Information ^ Systems 
Ltd.) in January 1965 and became a. director of 
the company. He was, appointed Associate Pip; 
fessor with the Department* of Electrical 'Eih 
gineering, University of Toronto, Canada, in January: 1965. He increase^ 
activity at ESE Limited as Director of Research in 1969' and became'! an 
Adjunct Professor with the Department of Electrical Engineering, Univer- 
sity of Toronto. He is author or co-author of over 60 published papers 
plus a variety of conference papers. He has obtained approximately 16 
patents. Presently, he is Vice-President of. Research .and .Development at 
Motorola Information Systems Ltd. .*...*"' *Vi .' »■ ■ ' ■ ^ J 
Dr. Lang is a member of the Association of Professional Engineers of 
Ontario. 




! *^Fred M. Loiigstaff was born March 3, 1926, in, 
I; ^Thorhloe, Onti Canada, He received the M.A.Sc. 
I degree in engineering physics from the University 
of Toronto, Canada, in 1951. t 
.<.He was with Ferranti Electric Ltd. from 1949 
, • to 1965. He was a founding member of ESE 
\. Limited in January 1965 and became a Director 
of the company. He has had extensive experience 
" > since the early 1950's in the architecture of com- 
I - t puter systems and in computer software, leading 
&j v; to ( the position since 1965 of Vice-President of 
"terns and' Software Development with ESE Limited (now Motorola 
Information Systems Ltd.), Rexdale, Ont. 

^Dr. Longstaff is a member of the Association of Professional Engineers 
of'Ontario and the Association for Computing Machinery. 



Shahid V. Qureshi (S'68-M'73-SM'81) was born 
in Peshawar, Pakistan, on September 22, 1945. 
He received the B.Sc. degree from the University 
of Engineering and Technology, Lahore, Pakis- 
tan, in 1967, the M.Sc. degree from the Univer- 
sity of Alberta, Edmonton, Canada, in 1970 and 
the Ph.D. degree from the University of Toronto, 
Toronto, Canada, in 1973, all in electrical en- 
gineering. 

From 1967 to 1968 he was a Lecturer with the 
Department of Electrical Engineering, University 
of Engineering and Technology, Lahore, Pakistan. He held the Canadian 
Commonwealth Scholarship from 1968 to 1972. 

Since 1973 he has been with Codex Corporation, where he is currently 
Senior Director of Modulation Products Research. His interests include 
data communication, computer architecture for signal processing and the 
application of digital signal processing to communication. 




877 



^^ELECTED AREAS IN COMMUNICATIONS. VOL. 7, NO. 6, AUGUST 1989 

Wmerisional Constellations— Part I: Introduction, 



figures of Merit, and Generalized Gross 

Constellations 



G. 



DAVID FORNEY, JR., fellow, ieee, and LEE-FANG WEI 



tS^ltidimensibnal constellations are desirable for repre- 
[JoMirmimbers of bits per two dimensions, useful for in- 
J^fflciencyij and natural for use with multidimensional 
SutaTlw^Desirable characteristics of such a constellation in- 
lls^ efficiency^ low, implementation complexity, compati- 
!^ed M odulation and with QAM modems, including small 
BpeWto^vVrage power ratio (PAR) of its constituent 2D con- 
ftl^^ymmetryV scalability^ and capability of supporting 
^n^gsecondary channel.;; The gain in SNR efficiency of 
15oinai , ?onstellation (lattice code) consisting of the points 
^tnlh^region^ compared to a cubic constellation is 
SappnRimately separable into the coding gain of the lattice 
Eslja ^lain;; of the region R, for large constellations. Sim- 
^pansion^of ihe associated constituent 2D constellation is 
^e^pprbximately "separable into a coding component 
CER,(R). The N sphere is the re- 
(ffithWest;sKape gain; but also has large constellation ex pan- 
^ for^tHe^best possible shape gain versus CER,((Rl) or PAR 
SFinally, generalized cross constellations are discussed; these 
lEfens^yielii a modest snape gain with very low CER,(I^) or 
|Ssily implemented, are well suited for use with coded QAM 
and v cah be "readily adapted to support an opportunistic sec- 

i'OiIi.|V.» : .v..-i • •: 
5fe?V a:*' Introduction 

Wfidimensional constellation C is a finite set of N 
uples^or^pointSj in N dimensional space. The size 
*^ohsteliatiori is the number of its points. 
^raisVa^fairlyj long history* of constellation designs 
ffiarly^for^two-dimensional constellations with | C | 
jK%^lorjb-not too large, e.g., 2 < b <,8. Much 
Mu^towis iecoiinted in,[l] and will not be repeated 
$l i*V-.. . . 

cenPyearSj^there has been increased attention to 
^dimensional coristellations. Some of this attention 
lel^due! to' systems considerations, e.g., the need to 
^ctmnaf^numbers /3. of .bits per two dimensions. 
iqg^|[beenjdue.tb the recognition that higher SNR 
len^'is^theoreticaliy attainable in higher dimensions. 
Iw^f^tbejfriajor impetus, in our view, has been the 
^Kpment j! of { il multidimensional coded modulation 
ffi^th^tjnaturally lead to consideration of multidi- 
nsional constellations. 
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In this paper, we shall first discuss the major attributes 
desired in constellations, such as SNR efficiency, sim- 
plicity of mapping bits to points and vice versa, compat- 
ibility with coded modulation schemes, and compatibility 
with QAM modulation. The size and peak-to-average 
power ratio of the constituent 2D constellation are impor- 
tant considerations. We discuss one feature that has re- 
ceived more attention in practice than in the literature, 
namely, the capability, of supporting an 4 'opportunistic 
secondary channel," often used for internal control sig- 
nalling. 

A general method for constellation design is to choose 
a finite set of | C | points from an N dimensional lattice A 
(or a translate of A) that lie within a finite region R. Such 
a constellation has also been called a lattice code. This 
method not only results in a regular array of points in N 
space, but also yields constellations suitable for use with 
coded modulation. 

If C is a lattice code of reasonably large size, then the 
distribution of its points in N space is well approximated 
by a uniform continuous distribution over the region R. 
This is called the continuous approximation. We review 
a standard figure of merit for constellations that deter- 
mines SNR efficiency, and show that under the continu- 
ous approximation it is separable into two parts: one which 
depends on the density of A, which is measured by the 
coding gain y c (A) of the lattice A, and the other which 
depends on the sphericity of the region R, which is mea- 
sured by the shape gain y s (B) of the region R. 

Similarly, the same approximation shows that for lat- 
tice codes there are two more or less independent factors 
that influence the size of the constituent 2D constellation: 
the coding constellation expansion ratio CER C (A) of the 
lattice A, and the shaping constellation expansion ratio 
CERj(R) of the region R. The former is the price paid 
for the lattice gain y c ( A), and the latter is the price paid 
for the shape gain y s (R). The peak-to-average power ra- 
tio (PAR) in two dimensions is also primarily a function 
of the shaping constellation expansion ratio. 

The coding gain of a lattice (or, more generally, of a 
coset code <£) and the coding constellation expansion ratio 
are addressed in the coded modulation literature (e.g., [4] 
and the references therein). This paper is primarily con- 
cerned with the shape gain 7,(R), the shaping constella- 
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tion expansion ratio CER 5 (IR), and other aspects of shap- 
ing with a region R. 

. As discussed in [1], it is relatively easy to get coding 
gains of the order of 3 to 4 dB, using either dense lattices 
A (lattice codes) or more general coset codes <£ (e.g., trel- 
lis codes), and there are more complex lattices and codes 
that achieve coding gains up to 6 dB, or more. However, 
as shown in [4], when we consider the effective coding 
gain (the coding gain decreased by an * 'error coefficient" 
factor), and take decoding complexity into account, we 
find that after the initial 3-4 dB it takes of the order of a 
doubling of complexity to achieve each 0.4 dB further 
increase in effective coding gain, for the best codes 
known. Consequently, even though the shape gain 7, (R) 
is limited to 1.53 dB, as we shall see below, it may be- 
come more attractive to invest complexity in shaping 
rather than in more complex codes when we seek to 
achieve the best possible performance for a given amount 
of computational complexity. 

When -R is an N cube, there is neither shape gain nor 
constellation expansion due to shaping. When R is an N 
sphere, shape gain is optimized (for N dimensions), but 
CER 5 (R) and the PAR in two dimensions increase rap- 
idly with N. By limiting the peak energy of the constituent 
2D constellation, CER 5 (R) and PAR can be significantly 
reduced, without much reduction of shape gain. We de- 
velop bounds on the best possible tradeoff between the 
shape gain y s (B) and CER,(R) or PAR, which are in 
principle achievable as N -► 00. 

The final section covers generalized cross constella- 
tions, which generalize some of the constructions that 
were presented in an uncoded context in [1], and that were 
independently developed for use in coded modulation sys- 
tems by Wei [2]. We show that they have many desirable 
practical attributes; in particular, they are simple to im- 
plement in coded QAM modems, have very low constel- 
lation expansion, and do achieve a modest shape gain. We 
indicate how they may be adapted to support an oppor- 
tunistic secondary channel. 

A companion paper [Part II] discusses Voronoi con- 
stellations, which represent another approach to multidi- 
mensional constellation design based on a lattice-theo- 
retic choice of R. In general, Voronoi constellations also 
have most of these desirable attributes, and can achieve 
significantly higher shape gains, but at the cost of some- 
what larger CER 5 (R) or PAR. 

II. Desirable Constellation Characteristics 
In this section we discuss various considerations that 
influence constellation design, with emphasis on attri- 
butes that are desirable for constellations to be used in 
coded QAM modems. Throughout this section, we use 
simple square (two-dimensional) or N cube (N dimen- 
sional) constellations as a baseline. 

A. SNR Efficiency: Constellation Figure of Merit 

Naturally, the first characteristic that we require of an 
N dimensional constellation C is that it be able to repre- 



sent a desired number b of bits, in N dimensions, f |Y 
means that its size | C | must be (at least) if: (We'in 
itly assume that data words are binary b tuples, andjt 
all data words are equally likely.) It is convenient toj 
malize to two dimensions; a constellation that can j 
sent b bits per N dimensions can represent a nontialug, 
bit rate of 0 = 2b /N bits per two dimensions ( 0 '| " 
per baud," or, loosely speaking, 0 bits /Hz). • 

For a given size | C | or normalized bit rate /3, thejj 
parameters that determine the SNR efficiency, of frJc 
stellation C are the minimum squared distance -d 2 J n l 
between its points, and its average power P(C).eInjg 
paper the average power P(C) will be taken as tl%a 
age energy of C per two dimensions where theenergffi 
an Af-dimensional constellation point jc is itsVEuclife" 
norm ||jc|| 2 , and. the average energy of C is^ayt 
E[ \\x\\ 2 ] when all | C J points' are equiprqbable', soJ° 



= 2£[||jc| 



For a given normalized bit rate (3, we generally ;wjs|gj 
maximize d 2 mm (C) for a given P(C), or tp'^uj^D 
P(C) for a given d 2 mln (C). Therefore it is cu^tom^ 
take the ratio ;. ,. * .... i «. . •utVm^!"*^;. irfl 

cfm(c) 4.^cj^(<:KJJ-J 

' In ■ : l -.**i*i>*M"» }t\ tit ^ 

as what we shall call the, constellation figure^of;MB 
When C is used for signaling over a white GaussianSp 
channel, the required signal-to-noise - ratio *(SNR)JJs 
versely proportional to CFM ( C ). Since scdin^a^ 
in a constellation by^ 

d 2 m[n (C) and P(C) by, a- 2 " .QEM GO), is<a Jdimen§g 
quantity that is unaffected by scaling! We therefore 
evaluate the CFM(C) by evaluating the average 
P(C) of a constellation;. CV for Iwhich^^C)' has 
normalized to 1. ) >^; rtlt } ' : ^'^M „ 
For example, in one dimension,' a' ■ PAM'consteJlati 
is a set of M = 2 b equally spaced pointston thcr 
centered at the origin, el'g//the f Mhal^ 
range from — I 
E\\\xf\ = 

dimensions ^ - v - / - - c ;> 

of a PAM constellation is thus'.CFM'C 'C^j^tf/ (M? 
> = 6/(2 0 - 1) where 0 = Ib is the'noimaliz^bif 
per two dimensions. Thus asymptotically v CFM;(i 

We therefore define the ( ^i;/iWco«to/&tfi%^i 
merit for a normalized bit rate of '|3 WCFM© ( r /3) 1 =■ 
This represents the asymptotic 5 SNR' efficiency 
achievable with simple," conventional *PAM^c4ns 
tions, and will be our referencei^any^more'co^ 
constellation with the same 0 % had fetter achieve.? 
of merit CFM(C) that is greaterHhan 'CFMeOg 
gain obtained by a constellation r C*Wh ; 'normal : 
rate 0 over the baseline is defined as^ 

CFM(C)/CFM e (0) = 2*-?Jfficjy[«^B 

In N dimensions, a cubic constellation is the Ca 
product of a PAM constellation with itself N timel 
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> dimensions, a "cubic constellation" is a 

H-sfense (QAM ") ' M x M constellation. 

ffins^nal cubic constellation has size \ C \ - M 
Jnormalized bit rate 0 = 2 log 2 At, and 

Effc.) = {M 2 - i)/6, if rfL(C) = i; 

. „J-Af,'1hV figure of merit of a cubic con- 
|pM*CC)V==/6/(M 2 - 1) - 6/2* = 
^there is ho asymptotic constellation gain. 

^m^ation Complexity < » 1 1 • , • 

I&esira t)le Characteristic is that the mappings 
l^^^o^s^td^ Idimensional signal points and 
"jBBS fcbffi simple to implement. The com- 
p ^lemen tation is not much of an issue for two- 
nB^llalionsf Tsi'n . lookups with 2 b 

||e|f^ like b = 7, or some- 

ioweve^^en^eVwish to send, say, /3 > 7 
jffirnen^bns.using constellations in N = 4 or 
ffl^teff ties"wi^'2 6 = 2^ /2 entries tend 
J|^M^^lwe K must. specify algorithms 
Ji^^ji^ lireHtibns whose complexity is rea- 
ICTQpofepfrsidef Here the receiver's first task, 
Sa^tl^ points actually 

g^^i bisy received sequence ; this task is 
iSgSj^iB* is Considered in the coded mod- 

eflt^SitSomplexity of a cubic constellation 
^^u^specify'eacH coordinate (or pair of 
Se^lien^ an N dimen- 

^ ^^he^ilif ^fold Cartesian product of an 
jgjona l£consreilati6ri with itself, then the 
SBKSmpMity ^is;- that of the constituent 
^ffS^^oMtellatidh; but' then the constel- 
if^en^ire^nbrmalized bit rate, and all other 
eteirare aWno different from those of 
TOpf j^imensiolnal constellation. 

™iW§AM Modems I: Constituent 2D 

^res^d injniplementing codes and con- 
^N^ quadrature amplitude modulation) 
imodem^transmits a sequence of two- 
IBolsStfterefore .we will want to choose 
ffin^sionyN^a^ an even integer, and to 
pnalfsignaL point as a sequence of N/2 

JM^ rTsional constellation C, a constitu- 
^^^]!is the set of all values that a given 
mlJolitakes'on as the N dimensional 
thro ugh, C, i.e., the, projection of C 
flp lm^ sionsijf, the constituent 2D con- 
^^ pf ^all pairs of coordinates, we say 
^^mtt^diconstellat ion. Then we may 
^s titue nt^D constellation C 2 of C. (More 
onstj^ritl^b constellation C 2 may be 
ToStR^various constituent 2D constel- 

" ' mi 
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With this definition, C must be some subset of the points 
in the N dimensional signal set C^ 2 , the Cartesian prod- 
uct of C 2 with itself N/2 times, so | C | < | C^| = 
| C 2 |" /2 . It follows that the size \C 2 \ of the constituent 
2D constellation is lowerbounded by | C | 2/jV = 2 2h/N = 
2*, whether or not the normalized bit rate j3 is an integer. 

In general, it is desirable that the constituent 2D con- 
stellation be as small as possible, as is discussed in [2]. 
We define the constellation expansion ratio of a constel- 
lation C as CER(C) 4 |C 2 |/2* = \C 2 \/\C\ 2 ' N . We 
will want to keep CER(C) as close to its lower bound of 
1 as we can. 

For an N dimensional cubic constellation, the constit- 
uent 2D constellation is a square M X At constellation, so 
| C 2 1 = At 2 = 2 & , and the constellation expansion ratio is 
1. 
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D. Compatibility with Coded Modulation 

Calderbank and Sloane [5] introduced the terminology 
of constructing trellis codes based on partitions of lattices 
into cosets of sublattices. In [4], it is shown that essen- 
tially all of the codes with large normalized bit rates that 
have been used in coded modulation systems to date, in- 
cluding both lattice codes and lattice-type trellis codes, 
can be characterized as codes of this type, called coset 
codes. 

The key ingredients of a coset code ©(A/ A'; E ) are a 
lattice A, sublattice A' of A, and an encoder E. m 

An N dimensional lattice A is an infinite set of N tuples 
(lattice points), such that the sum or difference of any two 
lattice points is another lattice point. That is, algebra- 
ically, A is a group under ordinary N tuple addition. Ge- 
ometrically, a lattice is a regular array of points that cover 
N space (assuming that the lattice points span N space). 

A sublattice A' of A is a subset of the points of A that 
is itself a lattice. Since A' is a subgroup of A, it induces 
a partition of A into | A /A' | cosets (translates) of the sub- 
lattice A' where the integer | A/A' | is called the order of 
the lattice partition A/ A' (or the index of A' in A). More 
generally, any translate A + a of A is the union of | A/A' | 
cosets of A'. 

In most practical coset codes, A and A' are taken to be 
binary lattices. A binary lattice A is a lattice of integer N 
tuples (i.e., a sublattice of the integer lattice Z N ) that has 
2 n Z N (the lattice of all N tuples of integer multiples of 2") 
as a sublattice for some integer n, so that Z N /A/2 n Z N is 
a lattice partition chain. If A' is a binary lattice that is a 
sublattice of a binary lattice A, then Z N /A/A' /2 n Z N is 
a lattice partition chain for some integer n. From this it 
follows that the order | A/A' | of the partition A/A' is a 
power of two, say 2 * + r . 

The encoder E may then be a rate-£/(fc + r) binary 
encoder, which takes in k bits per N dimensions and gen- 
erates fc + r coded bits per N dimensions. The redundancy 
of the encoder E is r bits per N dimensions, and its nor- 
malized redundancy is p (E ) = 2r/N bits per two dimen- 
sions. 
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t Fig. 1 then illustrates the general structure of a coset 
code C based on a partition A/A' of binary lattices, when 
the code is used to send n information bits per N dimen- 
sions. Of the n bits, k are encoded in the rate-fc/(* + r) 
binary encoder £, which generates k + r coded bits that 
select one of the | A/A' | = 2 k + r cosets of A' whose union 
is some given translate A 4- a of A. The remaining 
n - k "uncoded bits" select one of 2" * signal points 
from the selected coset. The encoder E is normally ar- 
ranged so that the 2 k + r cosets of A' in A + a are equi- 
probable. 

Such a code therefore requires a constellation C con- 
sisting of | C | = 2 b = 2" + r signal points from a coset A 
+ a of the lattice A, divided evenly into 2* + r subsets^ 
each subset consisting of 2 n ~ k points from one of the 2 
cosets of A' whose union is A + a. For the purposes of 
constellation design, the effective bit rate is the coded bit 
rate of b = n + r bits per N dimensions, rather than the 
information bit rate of n bits per N dimensions. The size 
| C | is therefore a factor of 2 r larger than would be nec- 
essary to represent n information bits per N dimensions 
with a lattice code based on A. Per two dimensions, the 
constellation expansion ratio due to the encoder E is 
therefore 2 2 "" « 2'<*>. 

As an example, consider the four-state two-dimensional 
code of Ungerboeck [6], illustrated in Figs. 2 and 3 for 
the case n = 4. This code is based on the 4-way lattice 
partition Z 2 /2Z 2 where Z 2 is the two-dimensional lattice 
consisting of all pairs of integers, and 2Z 2 is the two- 
dimensional lattice consisting of all pairs of even inte- 
gers. The encoder £ is a rate- 1/2 binary convolutional 
encoder whose two output bits select one of four cosets 
of 2Z 2 . The constellation expansion due to E is therefore 
a factor of 2, in two dimensions. 

To send n = 4 bits per two dimensions with this code, 
we thus require a 32-point signal constellation (rather than 
16 points), whose points lie in a coset of Z 2 and divide 
evenly into four subsets lying in four distinct cosets of 
2Z 2 . Two such constellations are shown in Fig. 3. Fig. 
3(a) is a "cross constellation" whose points lie in the 
coset Z 2 + (1/2, 1/2) of Z 2 (the set of all pairs of half- 
integers), which is the union of the four cosets 2Z + 
(1/2, 1/2), 2Z 2 + (-1/2, 1/2), 2Z 2 + (-1/2, 
-l/2),and2Z 2 + (1/2, - 1/2) of 2Z 2 , labeled A, B, 
C, D. Fig. 3(b) is a "square" constellation whose points 
lie in the coset Z 2 + ( 1/2, 0) of Z 2 , which is the union 
of the four cosets 2Z 2 + (-1/2, 1), 2Z 2 + (1/2, 1), 
2Z 2 + ( 1 /2, 0), and 2Z 2 + ( - 1 /2, 0) of 2Z 2 , labeled 
A, B, C, D. 

In general, the Ungerboeck two-dimensional lattice- 
type codes are based on the two-dimensional chain 
Z 2 /RZ 2 /R 2 Z 2 /R 3 Z 2 / • * ■ of two-way lattice partitions 
R j Z 2 /R j+ l Z 2 where R is the two-dimensional "rotation 
operator" defined by the 2 x 2 matrix { [1, 1], [1, - 1] }. 
Geometrically (apart from an immaterial reflection), R is 
a rotation by 45° with a scaling by a factor of 2 1 . Note 
that R 2 Z 2 = 2Z 2 ; thus the code above can be considered 
to be based on the chain Z 2 /RZ 2 /R 2 Z 2 . 
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Fig. 1. General structure of a coset code.C based on a lattice partitigi' 
A/A', when used to send n information bits per N dimensions. f 



1 bit 


4-state. rate-1/2 
convolutional 
encoder E 


2 coded bits 
> 


coset selector 
Z 2 /2Z 2 * 













3 uncoded bits 



signal point 
selector- * 



one of four 
cosets of 2Z 

one of 32 



J signal points ' i 



Fie 2 Four-state two-dimensional Ungerboeck code (n 



A B A B 

C D C D C D 

B A B A B A 

C D C D C D 



B 



BAB 
CDC 



(a) cross 



.)>.") A B 

V.c d c d 

A B.A B A B Ijm 

CD CD C D CD 

'A B* A b'a B f 

,i C D C;D 

■. - (b) square' 



;itt!.'t 



Fig 3 Two 32-point signal constellations based* on the 4-way 'p 
■ zV2Z?: (a)cross; (b) square/; '■; y-n ( £^1il 

It is shown in [4] that, without loss of generality,^ 
code G(A/A'; £).that is based, on ,a .partitiqn^A//' 
dimensional binary lattices jmay.al way s r be^ 
code Cizy/ROZ^.E':), i.e., asj|spp^»;«hi^ 
cubic lattice Z?, and A'; is ./?fz£ Trim^&X 
called the deptfi of the partition^A/A^^is^the le^ 
such that A/A'/K*Z? is^^lattic^RaiJitiqn ^ 
representation involves* an ^augm^ed^j 
whose normalized redundancy fiiEil l§\ ^equabtg^ 
malized redundancy p(C)'of the.cpdef.C^as.de 
[4]. The order of the paritiM^ 
= 2*" /2 , so that E' mustfputJTOt^/^c^lyl 
dimensions, or /i coded ,bits^pei;|twpl.di 
can be arranged to select a'cose^of/?^? SinjZJ 
selecting N/2 cosets ^ of .J?K^Wffiai^te:oW 
dimensions at a time!' (In this representation, J 
ranged to select only those cosets of R?,Z in Z , ■ • 
are actually in A + a, 1 so the 2^^'cpsets of ^Zja 
equiprobable.) ^ ,; t - ? V' f v 

From this representation, we maj; draw' the Mt^ 
consequences for multidimensional! constellatio^ 
First, the constituent 2D constellation 'jC£ ^^ mustJbeJ 
from some translate of Z 2 :;Secpnd; the points ill 1 1 
divide evenly into 2 M subsets, /each corresponding^ 
of the 2 M cosets of R*Z 2 whose union is the'giyen r 
of Z 2 . For most good known rades^the.deptl). 
the range 2 < fi < 4, so that the partitioning of 
stituent 2D constellation is 4-way, 8-way, or 16-\ 
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or example, the 16-state four-dimensional (4D) code 
1 2] is based on the. 8-way, lattice partition Z 4 /RD 4 
re RD 4 is the. four-dimensional lattice consisting of all 
ger4-tuples that, are either all even or all odd, and a 
-2/3 encoder £ whose, three output bits select one of 
8 cosets of RD 4 whose, union is a translate of Z 4 , e.g., 
translate Z 4 ;+ (.P/2,-yl/2, 1/2, 1/2). To send n bits 
4 dimensions,: the ;nj - 2 that are not inputs to the 
oder select one 'of 2"jl* points in the selected coset of 
,, The four-dimensional constellation C therefore con- 
sof 2" + l points'.from a.coset of Z 4 . The normalized 
undancy ofi the, encoder £ is p (£.). = * / 2 bit P er two 
iensions^ and # the^corresponding constellation expan- 

a ratio isj2?0; i ^^^ljy^ o;is;' nt? J • '.' 

\n alternative, representation of this code is based on 

16-wayi lattice! partition % Z 4 /2Z 4 and a rate-3/4 aug- 
nted encoder^,' ; 5 The\nOrmalized redundancy, remains 
£') ^fl/2./ Here^^J.selectsUwb successive cosets of 

from a constituent *2D Constellation that is partitioned 

0 four, equal subsets^ each corresponding to a coset of 

Since the normalized = redundancy, of this code is only 
2, the constellation* expansion ratio is lowerbounded by 
-. To sehdi l4-bits;per L four,dimensions, for example, 

need a constellation' of only.2!, 5 points from a coset of 
, rather than the 2[ 4 ipbints required for an uncoded sys- 
n. The minimum* possible,' size of 4 the constituent 2D 
nstellation is. thus 'ohly;2 7 t * .181 points, compared to 

= 128 with no /coding. Indeed; as was shown in [2], 
das will be recapitulated below, there is a "generalized 
oss constellation:.' Crthat Jcan .be used with this code 
nose constituent' 2D constellation has only |C 2 | = 192 
.ints, sothatCEROC) = \ C 2 \/2* t = 192/2 7 5 = 1.06. 

Compatibility with t QAM Modems II: Phase 

In QAM modems, .a carrier phase offset of 0 in the re- 
iver rotates the received constellation by a phase angle 
TKe receiver may,, and generally does, track the carrier 
lase directly.from the data ; signal, rather than from some 
.rrier tone. This makes two additional characteristics de- 
rable., First, ,there;should. be no point in the constituent 
[) constellation at the. origin, because such a point gives 
oinformation < about carrier phase.: (This consideration 
*es force for large < constellations, where the probability 

1 long runs, of signal points at the origin becomes negli- 
ible.) Second, the constituent 2D constellation should be 
ten to be invariant to as many phase rotations as pos- 
ibie, because the phase tracker may then quickly con- 
erge to steady state, and a differential coding technique 
lay be applied to guarantee transparency to phase ambi- 
.uities, as shown, for, example, in [7] and [2]. (In addi- 
,on, such phase symmetry assures that the mean signal is 
era and therefore that there will be no carrier component 
i the transmitted signal.) .;. 

For example, if the constituent 2D constellation is to 
x drawn from a coset of Z 2 , the only coset . that has no 
oint at the origin and has 4-way (quadrilateral) symmetry 



is the half integer grid Z 2 + (1/2, 1/2), illustrated in 
Fig. 3(a). The grid Z 2 + (1/2, 0) that is used in Fig. 
3(b) has only 2-way (bilateral) symmetry. 

Note that if the half-integer grid Z 2 + (1/2, 1/2) is 
partitioned into four cosets of 2Z 2 as in Fig. 3(a)— 
i.e., A = 2Z 2 + (1/2, 1/2), B = 2Z 2 + (-1/2, 1/2), 
C = 2Z 2 + (-1/2, -1/2), and D = 2Z 2 + (1/2, 
-1/2)— then a 90° rotation cycles A into B, B into C, C 
into D, and D into A. Therefore if the constituent 2D con- 
stellation is based on the half-integer grid and has 4-way 
symmetry, then it automatically divides equally into these 
four cosets of 2Z 2 ; conversely, if C 2 is constructed from 
all of the 90° rotations of any set of points from any of 
these four cosets, it will automatically have 4-way sym- 
metry. 

In summary, for use in QAM modems with a code C 
based on a partition A/ A' of binary lattices, the constit- 
uent 2D constellation must be a set of points from a coset 
Z 2 + a of Z 2 , preferably the half-integer grid Z 2 + ( 1 /2, 
1 /2), divided evenly as in Ungerboeck [6] into 2 M sub- 
sets, where \x. is the depth of the partition A/ A', and where 
the subsets consist of points from the 2* cosets of R^Z 2 
whose union is Z 2 + a. To send n information bits per Af 
dimensions, the size \C 2 \ of the constituent 2D constel- 
lation is lowerbounded by 2* + ' (c)where '' " 2 n/N, and p(C) 
is the normalized redundancy of the code 0. 

We may note that hexagonal constellations, with points 
from the hexagonal lattice A 2 , have also received some 
attention and have some modest advantages, although they 
have rarely been implemented in practice (but see [8]). 

F. Scalability 

Scalability is a property that is nice to have, although 
it is not necessary. Roughly speaking, a constellation C 
is said to be scalable if there is an infinite series of con- 
stellations C, C, C", • • • of increasing normalized bit 
rate (3 such that the properties of all constellations in the 
series are essentially the same as those of C, when ad- 
justed for /3. In this section, we shall merely introduce this 
idea with a simple illustration. 

Consider any two-dimensional constellation C based on 
the half-integer grid Z 2 + (1/2, 1/2). We may consider 
each point in the grid to be surrounded by a square of side 
1 centered on that point. This defines a region (template) 
R that consists of | C | such squares, and that has volume 
V(B) = | C |. For example, the regions corresponding to 
a 4 X 4 square constellation and the 32-point cross con- 
stellation of Fig. 3(a) are shown in Fig. 4(a) and (b). 

Suppose now that we subdivide each square into 4 
squares. This defines a new constellation C based on a 
scaled half-integer grid that, compared to the original 
constellation, has | C \ = 4C, d 2 min (C) = rfL(C)/4, 
and P(C') ^ P(C). Thus, the constellation figure of 
merit is reduced by a factor of 4, as we expect when | C \ 
= 2 & increases by a factor of 4. If the original constella- 
tion has quadrilateral symmetry, then so does the new 
constellation. 

By iterating this process, we can define a series of con- 
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(a) 4x4 square 



(b) 32-point cross 



Fig 4 Templates for 4 x 4 square and 32-point cross constellations: (a) 
4X4 square, (b) 32-point cross. 

stellations whose normalized bit rate increases by 2 bits 
per two dimensions at each step. The families of square 
and cross constellations can be generated in this way. (In 
the square case, we can start with the 2 X 2 square con- 
stellation.) ,. ., . 

It is clear that as | C | goes to infinity, the distribution 
of constellation points approaches a uniform distribution 
over R. In this sense, the continuous approximation is 
asymptotically exact. 

If we want to increase the normalized bit rate by 1 bit 
per two dimensions, we can put two points along a diag- 
onal of each square, and thereby double | C | while halv- 
ing dmin(C) and thus approximately halving CFM(C). 
For example, the 32-point "square" constellation of Fig. 
3(b) can be obtained from the 4 X 4 square template in 
this way. The points so obtained are in general on a scaled 
translate of RZ 2 rather than of Z 2 , and have only bilateral 
symmetry, even if the template has quadrilateral sym- 
metry. This is why different families (e.g., square and 
cross) are ordinarily used for even and odd normalized bit 
rates 0. 



G. Peak-to- Average Power Ratio 
[ A measure of the sensitivity of a signal constellation to 
nonlinearities and other signal-dependent perturbations is 
the peak-to-average power ratio (PAR), defined as the 
ratio of the peak energy of any signal point to the average 
energy or power. With N dimensional constellations, there 
is some question as to whether to measure the PAR in N 
dimensions, or in some lower number of dimensions With 
QAM modems, the PAR is usually defined in two dimen- 
sions, as the ratio of the peak energy of any signal point 
in the constituent 2D constellation C 2 to the average power 
(per two dimensions) P(C ). However, depending on the 
application, the value of the PAR in some other number 
of dimensions may be as or more significant. When nec- 
essary, we shall write PAR 2 or PAR* to distinguish the 
PAR of C 2 in two dimensions from that of C in N dimen- 

S1 °/Vdimensional cubic constellations may be regarded as 
the (N/2) fold Cartesian product of two-dimensional 
square constellations (for N even) . The peak energy of an 
M X M square constellation based on the half-integer grid 
Z 2 + (1/2, 1/2) is (M - D 2 /2 = 272 where 0 is the 
normalized bit rate per two dimensions Thus the PAR in 
two dimensions goes asymptotically to 3. In fact, the PAK 
of cubic constellations is asymptotically equal to 3 in any 
number of dimensions. 



The idea of an opportunistic secondary channel isju 
follows. Suppose that the primary data source generate^ 
b bit data word for every N dimensional signal point to- 
transmitted. Choose an TV dimensional signal constellap 
with size I C I greater than l\ and let certain of the data 
words be associated with more than one point in the w 
stellation. Whenever one of these words happens to 
generated, there is an opportunity to transmit addition 
side information, by choosing one of the ; two or more* 
nal points associated with the data word: For example 
i C i = 2 b + w, then we can associate 'a 1 pair of !sj 
points with w of the data words; with probability. ^ 
an additional "secondary channel bitV-can then ^e.tia 
mitted along with the b bit primary data word by^in 
N dimensional signal point, by 'selection- of Jonejot 
other of the pair of signal points associated-with that 

W °Of course, increasing the size | C | of the constella 
will normally somewhat increase its average po\yer 
degrade its SNR efficiency. As we shall see in some 
the examples below, the average data' rap ofUielof) 
tunistic secondary channel maybe? substantially^ 
than the increase in normalized bit 'rate! that; could) 
narily be obtained by the increase in average powej. 
other examples, the use of an opportunistic seep 
channel allows an increase in normalized bit rate of* 
fractional number of bits /Hz.isuch :as .might bt f 
for a low-speed control channel ^withdittle or no p 
in average power. These benefits are offset byitheA 
the transmission of secondary channeWata^s probatj 
tic so that secondary Channel data : ?must*beVquea 
buffers for transmission. Also, there, arises.^he R91§ffi 
of insertions and deletions' of secondary, channe 
above and beyond ordinary! errors,\However, .foc 
types of data, such as internal' signaling(andJcontro 
these are 1 perfectly acceptableUradeoffs#and^ 
ability of secondary 'channel' dataVto).Mpiggybac^ 
primary data channel with" little or'no degradation 
efficiency is a highly desirable feature^%y 

The expanded constellations fused^withKoppo 
secondary channels must 'retainithetdesirablejcljaa 
tics listed above; To minimize' degradatiorf.oPSM 
ciency, the points to be paired shouldlcomelfromUi 
points of the constellation^ and > ^'additional 
should have the smallest possible 'energy:, tfhe- ma 
secondary channel bits to signal points' should ,\» 
to implement. With coded modulation/ the two^ 
signal points associated with a' given' data word? 
belong to the same subset (coset;of;.A.'),!lf<,qua 
symmetry is required/ then ; the/expanded'.cons 
should be quadrilaterally symmetriC'and generalJ 
rotation of the set of signal points associated withto* 
word should be a set of points. ' associated ^ 
data word. ; ' •■«••• ■ ■ ... 

The opportunistic secondary channel idea seems 
been first embodied in the Codex/ESE SP14.4> 
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|8] and subsequently to have been reinvented more 
act [9], [10K [111; see also [12],,. 

ni. -Shaping and Coding 

s sectioii^^^^^ 1 that fora larg t e 
at oh C with^ndfhAaiiied; bit ratio ft that consists 
^ „ts f^aH^e^;+ a of a lattice A that fall 
„ a regions; the cbfistellation figure of merit is given 
,ximately by : the product ^ 




"^^SS^Afakd tf(ff) is the shape 
oft»^tegl6h'R!>iT<tf^the-gain'trom shaping by the 

, d $ ^«^toA^fetorthe.coaing gain y e (C) 



,orf#t&'p«^V»«f'*^* **> •* analogous. 

.steiia'donV.^ of T n 

imWmrm-^mmmn expansion rauo 

irAl^thS'la^^if/^ generally, of a code 

I ; arid the shaping expansion ratio CERj(R) of the re- 

\V^Mve an expression for the peak-to-average ratio 
tHe constituent 2D Constellation 1 of C that is a function 
dgfriR. and.consists,of the ; product of three terms: the 
;&average rati 0i of;the;constituent 2D region R 2 of 
^latio of the shape gain of R to that of R 2 , and he 
^^mdh tx^oti ratioCER;(R), with the 
jst^ually^being dominant. . t ■ 

iffl^e Gain and Coding Gain 
Ij^metnia for constructing an N dimensional 
mffl$A is to, take ,the points of some N dimen- 
^fetice^ (or.of a .translate ;A,+ a of A) that lie 
tomelegion R of N,space,.with the region chosen 
te enough to -enciose'.the desired number | C | oi 
feints>Such a' constellation: is also called a lattice 
^shall sometimes denote such a constellation by 

WmS geometric parameters of a lattice A are 
*«e; ( minimum. squared distance between its 
m iii'juhdamental. volume K(A), which is the 
TO'spabe corresponding tb each lattice point. (In 
S iMAj,is : ;any fundamental region of A-i.e., a re- 
fcimn'g bne;'and. only one point from .each coset 
HE;in.reV space <-then the volume of R A 
^ndW(space is, tessellated by the translates R(A) 
fe) ts)x ranges through all points in A. ) 
^estimate 'the parameters <J mi „(C), | C | , and 
^ constellation C(A, R) as follows. First the 
mw'squared distance of C(A, R) is d min (C) 
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d 2 • (A) If R is large relative to V( A), then we have as 
a first approximation principle the following proposition. 

Proposition 1: The size | C | of any constellation C A 
R) consisting of the points in any trans ate A + a of any 
lattice A that lie within any region R of volume K(R) is 
J "xfmately |C| » K(R)/K(A) if K(H) . much 
larger than the fundamental volume K( A) of A 

The normalized bit rate p = (2/N ) log 2 | C | of C( A. 
R) is therefore given approximately by ft - log 2 
[K(R)/K(A)f' w . The baseline constellation figure of 
merit for this fi is CFM©( 0) = 6/2* - 

6 To ( S!heaverage power P(C), we may use the 
following old, convenient, and quite accurate approxi- 
mation principle (called the "integral approximation in 
Til! but perhaps better called the continuous approxima- 

UO Proposition 2 (Continuous Approximation) : The aver- 
age power P(C) of any constellation C(A, R) consisting 
of the points in any translate A + a of any lattice A toat 
lie within any region R is approximately equal to the ave 
age power P(R) of a continuous distnbution that is uni- 
form with R and zero elsewhere 



P(C)-P(R)A[J.lr| a ^]/[W(R)/2] 

where II r f is the energy of a point r in R and K(R) = 
tdvlL volume of I, so that 1/K(R) is the density 
of a uniform continuous probability distnbution over R, 
and the division by N/2 normalizes to two <bmennoaa^ 
For example, in one dimension, if R is a line ^segmen 
of length R centered at the origin, the average energy is 
P 2 /12 in one dimension, so the average power is P(H) 
= R 2 /6 per two dimensions. In N dimensions the aver- 
age energy of a region R which is an N cube of side R is 
Ntimes the average energy of one coordinate so the aver- 
age power per two dimensions remains P(R - * l /t- 
The volume of such a region is V(h) - R , so r(W) 

^SLstellation figure of merit of any C(A . Rj > is 
therefore approximated by CFM(C) = d mi „(A)/P(R)- 
The gain over the baseline figure of merit is 

CFM(C)/CFM®(p-) 

= [dUA)/v( a) 2/ "] • [n*) 2 "7 6P ^-. 

Here we have grouped the gain into two term. Jjhtehde- 
pend only on A and on R, respectively. The first term is 

7.(A) A d 2 min {A)/V(\f /N , 
which we call the coding gain of the lattice A (i.e., the 
'fundamental coding gain' [Forney, 1988] of the lattice A 
asalattice code). Note that 7c (Z N ) = 1 . The second term 
is 

7j (R) i V(R) 2/W /[6P(R)]. 
which we call the shape gain of the region R. Note that 
the shape gain of an N cube is 1 . 
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..The baseline figure of merit CFM e (/3) is approxi- 
mately the figure of merit of a cubic constellation with a 
normalized bit rate of /3, i.e., of a lattice code C(A, R) 
in which A is the integer lattice Z N and R is an TV cube. 
We see that by drawing points from a lattice A that is 
denser than Z N , we can improve SNR efficiency by a fac- 
tor of 7 C ( A), the coding gain of A. By choosing the con- 
stellation from a region R that is more spherical than an 
N cube, we can improve SNR efficiency by a factor of 
Y,(R), the shape gain of 111 To the accuracy of our ap- 
proximations, these two improvement factors are quite in- 
dependent. 

Similarly, if we use a coset code C(A/A'; E) with a 
constellation C(A, R) consisting of the points from A + 
a that lie within R 9 then we can show that the gain over 
CFM e (/3) is given approximately by the product y c (C) 
' 7,(R)t if tne coding gain of C is defined by 

7c(C) i4(C)/K(e) 2/N 

where dm in (C) is the minimum squared distance between 
sequences in <S, and K(S) is the fundamental volume of 
C. (In [4], it is shown that V(C) = 2 r K( A), where r is 
the redundancy of E, or V(C) 2/N = 2 p(c) where p{ C) is 
the normalized redundancy of C.) Thus, the separability 
of coding gain and shape gain extends to general coset 
codes, not just lattice codes. 

B. Shape Gain and Normalized Second Moment 

The "dimensionless second moment" [13] or normal- 
ized second moment [14] of a region R is defined as 

G(R) 4 P(R)/[2V{R) 2/N ] 



-[t |r| a ^]/[i^(H) l+<aW 



bounded by an N cube of the same value. We shallot 
give its value in dB. : w i:r 

C. Shaping and Coding Constellation Expansion Rat 

If an N dimensional constellation C ( A, R ) | .consist 
the points of some TV dimensional lattice A J or of a7" 
late A + a of A, that lie within some. region^ of ATs 
then the constituent 2D constellation C' 2 of [CiAff^ 
sists of the points of the two-dimensional projectionJA 
A, or a translate A 2 + a of A 2 , that lie within the 
dimensional projection R 2 of R. If A 2 and R 2 do'nof 
pend on which two dimensions we pick,' then ? 
and R ID-symmetric, and we may speak of A 2 as ^ 
stituent 2D lattice of A^.and of J^ 2 .as the constituent 

region of R. r ...... . ji ' Ju .^'m* 

It is easy to verify that A 2 , is in fact a twb-dimensi 
lattice, since the sum and difference of any two ppw 
A 2 must be in A 2 . Also, A must be a .sublattice ofj Aj) 
If A is a sublattice , of Z N y then A 2 is a sublaUice^ 
We usually use lattices A' for which '•A'i'^j.Z^'TK' 
malized redundancy of a binary lattice is defined.aSj 
4 (2/N) log 2 |(A 2 )^ 2 /A| = lo^ Lpf(^) 2/ ^^p 
or, if A 2 = Z 2 , simply p(A) = lpg 2 ^A) 2 /^ 

By Proposition 1, | C | = .K(P)/^(A^.The f jii^ 
constituent 2D constellation, by the^same apprpxiriia 
is|C 2 | = K(K 2 )/K(A 2 ); Thus" ^C^fccjflff 
imately , , . ..... . ... .- n ,, r , Vt , , Jn ,,,..-,; ;; ,U H 

CER(C) = iQl/ICl 2 ^ = [K(X) 2 ^/y(A^ 



]• 



Thus the shape gain is simply y s (R) = 1/[12G(1F3)] = 
Ge/G(R) where the baseline second moment G® = 
1/12 is the normalized second moment of any N cube. 
Thus, the shape gain is a measure of the improvement in 
normalized second moment relative to an N cube, and nu- 
merically is simply a constant multiplied by the inverse 
ofG(R). '. .. 

The shape gain 7j(R) [or equally the normalized sec- 
ond moment G(R)] has the following properties (com- 
pare the properties of the coding gain 7 C ( A) or of 7 C (0) 

[4]) * 1,N 

a) It is dimensionless; both P(R) and V(Rr /iW have 

the dimensions of a two-dimensional volume (area). 

b) It is invariant to scaling: 7*0^) = 7*( R ) where cc 
is any scale factor. 

c) It is invariant to orthogonal transformations of R. 

d) It is invariant to the Cartesian product operation, 
y s (R M ) = y s (R) because P(R M ) = P(R) and V(R M ) 
= V(R) M . 

e) By definition, the shape gain y s (R) is a power mea- 
sure; it measures the reduction in the average power (per 
two dimensions) required by a constellation bounded by 
R compared to what would be required by a constellation 



This is the product of two independent 'factors: the 
constellation expansion ratio 'of A, defined as 0 tf 



CER r 



(A),:4^(A)^ 

and the shaping constellation ? expansion" ratio' 'off 
fined as . " ? '-V . ■• >fc ^Z-'^^i 

Note that CER C ( Z N ) ' = *1 , and ;CER^( R 
N cube. Again, as with the constelktion .figure^of 
the coding constellation expansion raho v GER?(f/j?J 
the lattice A and the shaping ;^cpnsteU^ibn^ex 
CER,(R) due to the region' R 1 arc r com'ple?ely^ecW 
to the accuracy of the approximation 1 of Proposition 

The baseline size t \C \ 2/N =;2^ is ;apprpxim^ 
size of the constituent 2D constellation if -we^sen 
per two dimensions using : a constellation • 
translate of Z N and shaped like 1 an N cube^in^wliic 
A 2 == Z 2 . If we use a lattice A that isVstrict'subl 
(A 2 ) A//2 , we suffer a constellation expansion^ofia 
of CER C (A): By choosing the constellation^ffohVa 
R that is a strict subregion of (R 2 )^ /2 ;Ve*suffer a 
constellation expansion of a factor of CER 5 (I^)) 
are costs associated with achieving the lattice 'gain 
and shape gain y s (R). % ^ vfM 

When Ax is a 2D symmetric ' binary'^ latti 
CER C (A) = 2 P(A) where p(A) is' the normaliz 
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of A. Similarly, for a coset code C based on a par- 
of binary, lattices, the coding constellation expan- 
atio is CER c (p) i = > 2 p f c) where p(G) is the nor- 
:d redundancy of d [4].' It may. be shown that if the 
Elation used with such a code is bounded by the re- 
1, then it remains true that 

v = 2n //V is'th'^ bit rate of 

e pea&t^ a constellation may 

be^estiniated' by 1 . the^contiriuous approximation, al- 
■h witti less accuracy wtfen the points in A are at some 
nee from the^^ the bound- 

if R. In N dimensionslfthe' peak energy r 2 m&x (C) is 
rbounded >arid'>approximated ; by. the ..peak, energy 
< R) 1: of anf ppini. in'the 4 region . Similarly ^ in two 
nsions! r*&(Cit& upperboiinded and approximated 
J BX (B 2 )i The average power;per two dimensions in 
r case is P( f C), || whichRisJapproximated by P(R). 
. efore P AR 2 ( C ) Is Approximated by 

Aen;by itself Z the! region -Bp has a peak-to-average 

erratioof^^K^no^-^n'i! 'U-'>: : -.':^ ^ '*! ' - 

re!i? (K 2 V is tthe average power under a uniform dis- 
jtion over U 2 : But the;shape gains of R and R 2 are 

7 (R) = y(R) 2/ 7[6P(R)], and 

/T if'-i%hi.".. uitfj •» v r/, * 1 ' ■ ' 

^I^(P2) : ==^(^2)/[6P(^)], . 

lectively^Therefore have' the useful relationship 

•4PAft,(Ri). ,i [*(n>/7i(Ri)] • , CERi(IS). (*) 
in words 1 , ,the peak-to-average power ratio in two di- 
nsions'is tht'producV of three factors: the peak-to-aver- 
: ^we^ratio.of tfi 2D region R 2 , the im- 

►vement^ of the N dimensional region R 

cr^at':pf jRi t k .and the constellation expansion ratio 
iRi(^tof the region R Since an improvement in shape 
in is one df^the purposes of going to N dimensions, this 
nmil^ of regions R* which have small 

tt^ii^ion^expansion ratios and which have constituent 
)^M^icfi;themsclvcs have a good PAR. 

fij •> V ' iVV f Bounds ; on ' Shape ' Gain 1 
In this section we^rst compute the shape gam of the A/ 
hcrcXwhich is the largest possible shape gain of an N 
mcnsional region KL.Wenote, however, that the shap- 
g wniteUation expansion ratio and thus PAR become 
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large as N increases. The best tradeoff between shape gain 
7j (R) and CER,(R) or PAR(R) is obtained when we 
bound the peak energy of the constituent 2D constellation 
within a circle, and we evaluate this optimum tradeoff. 

A. Spherical Constellations 

For a given volume K(R), the N dimensional region R 
that has the minimum average power P(R) and thus max- 
imum shape gain 7,(R) is obviously an N sphere. The 
volume K® and average power P® of an N sphere of ra- 
dius R are, for N even 

7® = W/lnl]; 

P® = R 2 /(n + 1) 
where (in this section only) we define n = N/2, thus, 
the shape gain of the N sphere is 

7® = (V*) l/H /[*P*] = *(" + l)/[6(n!) I/n ], 
while the shaping constellation expansion ratio and peak- 
to-average power ratio are 

CER® = (irR 2 )/(V*) }/n = (n!) ,/n ; 

PAR® = R 2 /P® 

= (6/tt)7® ' CER(g>, 

again with n = N/2. Table I gives y®(N) 9 CERg^N), 
and PAR® (N)fov reference. 

As is noted in [1], using the Stirling approximation n\ 
= (n/e) fl , we can see that the shape gain 7® (AO ap- 
proaches ire/6 (1.53 dB) as N goes to infinity. 

The N sphere has the optimum shape gain in N dimen- 
sions and thus the optimum SNR efficiency. However, 
implementation complexity may be high. Also, as we see 
from Table 1, the size and peak-to-average power ratio of 
the constituent 2D constellation become large as N in- 
creases. The main object of N dimensional constellation 
design is to obtain a shape gain as close to that of the N 
sphere as possible, while maintaining reasonable imple- 
mentation complexity, constituent 2D constellation size, 
and other desirable constellation characteristics. 

The peak-to-average power ratio (PAR) for the N sphere 
illustrates what may be expected in general for N dimen- 
sional constellations with good shape gains. The peak en- 
ergy in N dimensions is R 2 , so the peak-to-average ratio 
PAR/v = R 2 /nP® in N = 2n dimensions is (n + \)/n 
and approaches 1 as N becomes large (as the volume of 
the sphere becomes concentrated near its boundary). 
However, the projection of an N sphere of radius R onto 
two dimensions (the boundary of the constituent 2D con- 
stellation) is a circle of radius R, so the peak energy in 
two dimensions is also R 2 ; the peak-to-average ratio 
PAR® = R 2 /P® in two dimensions is thus n + 1 = (N 
+ 2)/2, and approaches infinity as N becomes large. As 
shown in Table I, for N = 2, PAR® is only 2, better than 
that of the square, but for N = 4, PAR® is already 3, 
equal to that of the 4-cube, and for larger N, PAR® 
quickly exceeds the PAR of cubic constellations. 
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TABLE 1 

Shape Gains and Constellation Expansion Ratios of N Spheres 



U 

2 
4 
8 
12 
16 
24 
32 



ic/3« 1.047 0.20 

n/2 372 = 1.111 0.46 

1.183 0.73 

1.224 0.88 

1.252 0.98 

1.287 1.10 

1.309 1.17 



1 

1.414 
2.213 
2.994 
3.764 
5.289 
6.800 



2 

3 

5 

7 

9 

13 

17 



The relation PAR®(/V) = (6/ir) 7 ®(AOCER®(/V) is 
an instance of the general relation (*), since PAR® (2) - 
2 and 7 ®(2) = tt/3, and shows that the main contribu- 
tion to the increase in PAR® (N ) as a function of N is the 

increase in CER®( AO- 

For example, to send 0 = 7.5 bits per two dimensions 
using a spherical constellation of 2 15 points from Z .Oie 
constituent 2D constellation contains about 2 -z 
256 points and has a PAR of about 3. To send 0 = 7.25 
bits per two dimensions using a spherical constellation ot 
2 29 points from Z 8 , the constituent 2D constellation con- 
tains about (24)'/ 4 • 2 29 ' 4 « 337 points and has a PAR 
of about 5. As shown in [1] and [2] and recapitulated be- 
low, there are easily implemented "generalized cross 
constellations" for these two cases with 192 and 60 
points, respectively (compared to the baseline sizes of 181 
and 152 points, respectively), with PARs close to 2, and 
with shape gains of the order of 0.3 dB. 

Note Sloane [15] calls the set of points of an N dimen- 
sional lattice that lie within an N sphere a "code of 
bounded energy," employs the continuous approxima- 
tion and gives a bound on the approximation error due to 
Walfisz. Calderbank and Sloane [5] generally use spher- 
ical constellations with their codes, and include the N 
sphere shape gain in their figures for total coding gain, 
with asymptotic bounds. 

B. Bounds on Shape Gain Versus CER S and PAR 

We implicitly assume throughout this paper that the 
probability distribution on the 2 b points in an N dimen- 
sional constellation C is uniform. However, the induced 
probability distribution on the points in the constituent2D 
constellation will be nonuniform in general (unless C is 
the Cartesian product of constituent 2D constellations). 
Indeed, with spherical constellations, this probability dis- 
tribution becomes Gaussian as N becomes large, as we 
shall shortly see, so that the frequency of occurrence of 
the outermost points in the constituent 2D constellation 
becomes small. This suggests that the peak energy R of 
the constituent 2D constellation may be bounded to some 
extent without greatly affecting the sphericity or shape 
gain of the N dimensional constellation. 

In this section, we develop bounds on shape gain for 
such constellations, which show that substantial shape 
gains can be achieved even when the peak-to-average 
power ratio (PAR) and shaping constellation expansion 
ratio (CER f ) are kept within reasonable limits. 



SELECTED AREAS IN COMMUTATIONS. VOL. 7. NO. 6. AUGUST 1959 

Letp(*) be the probability distribution on the points, 
in the constituent 2D constellation C 2 induced by tip 
form distribution on the 2 b points in C; we .assume^ 
simplicity that p{x) is identical for all 2D -wnsutu: 
constellations. Let • ' , ,;,;„ ; j '^'^ 

H{Ct) * -Sp(*)iog2P(^Kj 

be the entropy (in bits per .two dimensions) pfjlusTtw 
dimensional distribution. By a basic information-theo 
inequality, (JV/2>*<C a ) * «<C) =. b, the MM 
the uniform N dimensional distribution over the 2. u p 
in C, with equality if and only if the N dimension^ 
tribution is the Cartesian product of N/2 two-dimerg 
distributions p(x). Thus', the, entropy of the ^istributt 
p(x) is lowerbounded by the normalized bit rate/' 

H(C 2 ) >-2b/N-= : &.^'\ «T"' 

Under the same assumptions, the average energy^ 
N dimensional constellation Cu-Nfl^^^S* 
energy • V,,v;.v ' \i ■? 'i 



of the constituent 2D constellation, . and the.average ps 
P( C ) per two dimensions is equal to P(C 2 ). ,J '& 

For large constellations, .we rmay again.=resort|Us 
continuous approximation. The continuous pro 
distribution p(x) that. maximizes ttie. differential. en 
H(X ) for a given average power P(X), or that mi 
P(X) for a given H(X), is a two-dimensional! Ga 
distribution .•S-,,Hi / .r.v^.;»u|l^?^ ) | 
p(x) = (2^ 2 )- , exp^||x|fy?a 2 .^ 

The average energy P(X ) per two dimensions is tvd 
one-dimensional variance' a 2 ;<and the differentiate* 
H(X ) per two dimensions is log 2 .2ir eo^-M $ IS $M. 
are taken from a coset of the integer lattice Z. the 
point is associated with a ; unit; volume- of ; tw. 
( K(Z 2 ) = 1), and the entropy H(>C^ ofodigcs 
ability distribution on these points that approxima^s 
is also approximately,^ 2 7 ea 2 , ,H S nce^it tekesjn: 




mensions uuu uw,»o -- yw/i^v >,w 

baseline average power .for this :bit;rate,is ( 7|® > 4.fl 
2"< X) /6 = 2wff 2 /6;.,therefore'the f shape / ,ga 
Gaussianp(*)'is «e'/6,h lAp^m^. 
to a proof that me maximum possible . shape;gam_ 
N dimensional constellation.;G;issl.: ( 53,.dB i ; fr an(J 
equality holds when and only when.the- induced 
tion p(x) is Gaussian, and'ra.! spherical; cons 
achieves this shape gain: as J^J^^WW 
shows that the induced two-dimensional distnbuU 
spherical constellation becomes Gaussian as N f * 
Suppose now that we require that p(x) be, zero 
a circle of radius R. By standard variational;* 
using a Lagrange multiplier X, the probability distn 
p{x) (discrete or continuous)' that maximizes the 
H{X ) for a given average power P (X ),'.or that , 
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>r a given H(X)\ given this constraint, is a two- 
.mal truncated Gaussian distribution 
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/*(X^ 



jell 2 > R 2 



. is a parameter tiia(^ 

age power P(X) arid the entropy H(\), and K(\) 
fficient" chosen to ; make u p (>) = 1. For X = 0, 
uniform within jtHe- cifcle^of radius R\ as X -> oo, 
iiproachcs^a !ttauss^ with variance 

,i using the continuous approximation, we can de- 
/f(X);;P(X);;and}W form as fol- 



8 i.o 




Fig. 5. Best possible tradeoff between shape gain 7, (IS) and shaping con- 
stellation expansion ratio CER S (I3). 



/,(X) and •> 2 (XCare"giVen- byV<,v.v ^t? ... 

peak-to-awratep^ 20 
llation and' the shape gain afe.th'en' given paramet- 

b y.. s-.yji^.'.i ^rfi : <ii''.'ji'. : 

AR(X) =^/P(X)i=^X7i(X)/J 2 (X); . 

-.■>! j. • U; 'li'.'.vV,.'..o:;i 'V.'- •' - '" 
7j (X) = P 9 [H(\)]/E{\) . • 

-;f*>^?^$)l e *i> A(x)/i,(x). 

arse, the size of the constituent 2D constellation is 
ihe constellation expansion ratio relative to the base- 

ize of^^Ms I'^M^ij^i'r /; " -v : ^ • 
CErM=1 exp i J 2 (X jjtt,(X j. 

ain, the relation between these three quantities, 

tarticular case;bf the general relation (*), which can 
lerpreted as follows; The PAR is greater than its min- 
11 value of 2 = PAR®(2) = (6/x)y«(2) b Y the 
act of the, (desirable) .factor y s (\)/y® (2) with the 
osirable) factor CER,(X). 

>r \ = 0, p(x) becomes uniform over the circle of 
is/?, wAM0h-*/*R\ PiO) = R 2 /2, H(0) = 
i*\.PAR(0) « 2,. 7 ,(0) = t/3 = 1.047 (0.20 
. and CER,(0) = ,1 , as we have previously calculated 
two-dimensional 1 spherical constellations; As X in- 
sevPAR(X), CER-(X);. and y s (\) increase. The 
coff between CER;(X) and 7,(X) is illustrated in Fig. 
he tradeoff between PAR ( X ) and 7, XX), in Fig. 6. 
.'e see that the. shape gain can be as large as 1 .20 dB 
e allow the two-dimensional peak-to-average power 
,>to increase to 3 (the same as the PAR of a square 
illation); this PAR is the product of the circular con- 




2.0 2.5 3.0 3.5 4.0 4.5 5.0 
PAR 

Fig. 6. Best possible tradeoff between shape gain y s (B) and peak-to-aver- 
age power ratio PAR ( U ) . 

stellation PAR, namely 2, times a factor of about 1.25 
due an increase of 1 dB in shape gain, times the CER 
factor of about 1.2. The shape gain can be as large as 1.50 
dB, effectively optimum, if we allow the two-dimensional 
peak-to-average power ratio to increase to 5; this PAR is 
2, times a factor of about 1.35 due to an increase of 1.3 
dB in shape gain, times the CER factor of about 1 .85. As 
X- oo./rfX)- hhM^ 1,PAR(X) oo,CER,(X) 
- 00, and 7j (X) - ire/6 = 1.423 (1.53 dB), as we 
already know. 

On the other hand, given a two-dimensional distribu- 
tion p(x) with average energy P(X) and entropy H(X), 
it follows from fundamental theorems of information the- 
ory that, for N large, the (N/2) fold Cartesian product of 
p(x) with itself is an N dimensional distribution, under 
which there are a set of approximately 2 NH(X)/1 "typical 
sequences" (N tuples), each of probability approxi- 
mately 2~ AW(X)/2 , that account for nearly all of the total 
probability measure on N tuples. Taking these "typical 
sequences" as the coordinates of N dimensional signal 
points, we arrive at a constellation Cthat represents nearly 
H(X ) bits per two dimensions with average energy P(X ). 
Thus, the tradeoffs shown in Figs. 5 and 6 are effectively 
achievable, if we are prepared to consider arbitrarily large 
numbers N of dimensions. 

We conclude that it is possible in principle to achieve 
shape gains of more than 1 dB while keeping the peak-to- 
average power ratio of the constituent 2D constellation 
below 3, with the constellation size no more than 1 .2 times 
the minimum possible (for such a shape gain), or, to 
achieve effectively optimum shape gains (1.5 dB) with 
PAR no greater than 5. In order to achieve these results, 
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the induced two-dimensional distribution p(x) must ap- 
proximate a Gaussian distribution, truncated within a cir- 
cle. 

V. Generalized Cross Constellations 
Cross constellations are practical 2 fr -point two-dimen- 
sional constellations based on the half-integer grid, and 
have been incorporated into international standards for 9.6 
kbit/s dial modems (CCITT Recommendation V. 32) and 
14.4 kbit/s private-line modems (CCITT Recommenda- 
tion V.33). Extensions of the cross constellation con- 
struction have been previously published in [1] and [2). 
In this section we review previous constructions, describe 
a general construction for sending any integer number b 
of bits per N = 2 m+l dimensions, and show that the re- 
sulting generalized cross constellations have moderate 
shape gains (of the order of 0.3 dB), very small shaping 
constellation expansion ratios, and other desirable 
practical attributes. Such constellations. have in fact been 
used in Codex 19.2 kbit/s modems, in conjunction with 
multidimensional trellis codes [2]. 

A. Cross Constellations 

A * 'cross constellation" is a simply constructed 2 fc -point 
two-dimensional constellation that asymptotically 
achieves a shape gain of 32/31 (0.14 dB) relative to a 
square constellation with the same number of points [1]. 

To construct a 2 54 * 2 *-point cross constellation, we use 
the 32-square template U of Fig. 4(b), and scale (by par- 
titioning each square into 4 squares) k times; the constel- 
lation is then the set of 2 5 + 2 * half-integer grid points lo- 
cated in the middle of the resulting squares. (For a 
2 4 + 2k point constellation, we simply take every other point 
from the 2 5 + 2k point cross constellation in checkerboard 
fashion; the resulting constellation has points from the 
grid RZ 2 + ( 1 /2, 1 /2 ) and has only two-way rather than 
four-way symmetry.) 

The template R of Fig. 4(b), which has the shape of a 
cross, has V(B) = 32 and P(R) = 31/6, so y s (R) = 
K(R)/6F(R) = 32/31 (0.14 dB). Therefore, asymp- 
totically, cross constellations achieve most of the 0.20 dB 
of shape gain that could be obtained by circular constel- 
lations in two-space. The 32-point cross constellation of 
Fig. 3(a) has average power P(C) = 5 (6.99 dB) [com- 
pare the continuous approximation P(R) = 31/6 = 5 
1 / 6 (7. 13 dB)]; the 32-point square constellation of Fig. 
3(b) has average power P(C) = 63/12 = 5.25 (7.20 
dB), which is 0.21 dB worse. [The baseline average 
power is P©(5) = 32/6 = 5.33 (7.27 dB)]. 

To the accuracy of the continuous approximation 
(which is not too good until 2 b becomes quite large), the 
peak energy of R is 13, so the peak-to-average power ra- 
tio is asymptotically 13/(31/6) = 78/31 = 2.52, about 
halfway between the square and the circle. (The constel- 
lation expansion ratio is of course 1 because a cross con- 
stellation C is two dimensional. ) Implementation com- 
plexity for cross constellations is trivial. Cross 
constellations of 32 or more points divide evenly into 4 
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or 8 subsets corresponding to cosets of 2Z 2 or 2/?Z 2 ,.and* 
those of 128 points or more divide evenly. into 16 subsejji 
corresponding to cosets of 4Z 2 . Thus, cross constella- 
tions are well suited for QAM modems', and for coded 
modulation. 1 

4 'Reverse scaling' ' of the cross template of JFfe. 4jg 
suggests a constellation that gives us a^first chance to J 
lustrate an opportunistic secondary, channel .'FigV7(aj|js 
a template consisting ; of 8, "squares V f !where i thefour.^ 
"squares" are composed of two half-squares' that wrag 
(modulo 3) around the boundaries.' If, we place a conste 
lation point in the "middle" of each of these >[g! 
"squares," then four of the points lie. on the.boundapf 
and there are two possible positions where they 'could 
placed. If we leave this ambiguity unresolved, then 
arrive at the 12-point constellation, of 'Fig. 7( b )> 5V?W 
inner points and 4 pairs of boundary points . s • ^ ^ 

This constellation can be useiUo'sendj3 bits^per|g» 
dimensions, and in addition to support v an ppportunistic 
secondary channel with average rate of V/2 bit per 
dimensions. The average power is now i?(C ) 'l?5 f (»l 
dB), which is nearly 1 dB better than. the baseline avera 
power to send 0 = 3.5 bits per' two. dimensions 
p e (3.5) = 2 3 5 /6 = 1.89 (2.75 (dB)?^Also/Fig| 
has quadrilateral symmetry and other desirable 
istics. However, the fact that'' ^4' S ^9^^ ^M} 
probabilistic leads to systems' issues such as thos^I 
cussed in Section II-H. Subject to this qualificatio^ 
example shows that opportunistic secondary' channels 
yield high SNR efficiency: ' 'j^ ' 

B. Generalized Cross .Constellations ; t ^ \ \- 

We now begin to generalize the cross constellation s 
to higher dimensions where fractional normal izedibifja 



0 can be obtained, Since^ takest2f^point;W^|]^ 
sional constellations to represent ft, bitsf per .twp^un 
sions, it becomes , increasingly, {d^ 
coding simplicity, even at s6me;costan ? shape gaiii. 

Generalized cross constellationsfare built up/ from 
stituent 2D constellations ^i^^^P^^ff^PS^ 
a coset Z 2 '+ a of the, integer' lattice- Z 2 ;te.g.v'the 
integer grid Z 2 + ( l/2^r/2)^he"^^) ; fbld^p 
product C% /2 - is then a set'ofM-C 2 
, responding coset- (Z* (+\a)^€z|^flf«^ 
long as |C 2 | is greater than ^or {equalUp^p'baseliS 
2 fi 9 it will be possible to find,alsu^^ 
elements where 0 = ^fr/AJ'isithiBtWm^i^l^F^ 
principal virtues of generalized* crossyconstellatio 
that their constellation expansion rratio'%(> 2 T/?U;§ 
to 1, and that the subset 1 . C can be. defined; by 
coding rule. * ■ ' -' -"• *^yn«hjfej^|^tt^ :ffe 
(Generalized, cross constellations / can (also :bej>." 
construct constellations C with points 1 from any birr 
tice A. We start with a generalized cross constellati; 
that we. would use if A were:Z/VlSince 'any^transJS 
Z N is the union of | Z N / A \ ' cosetsV (translates): offiE 
will be a union of | Z N /A\ constellations: each cbjpjR 
of points from a single coset of A, each : with about' 
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...."lU.'.i.'-l'Ai 0 f?'.' "■ * 
(a) template»\i:«ti>'.ri »H *> **'''' 0>) constellation 
Template ana' •4i»ss < coh , S teiiation" for sending b « 3 b to perty 
asions, plus a ,i/2bi,.(per "2D) opportumst.c secondary channel, 
,irs of boundary,points (shaded)^ j ( . . . • 

ly) I C I /| Zff/A'.|3p5ihts*5?i and we.may take any ot 
as C. Note that thVeffective bit rate 0 is reduced by 
^rmalizediredundancy/pCA)^ (2/N) log 2 1 Z /A |. 
we.shaU'consider/ohlyVthe.case A = Z .) 
'send a half-integeri number, of.bits per two dimen- 
say 0 =,n ,+ ,l/2:with it an integer, we may use a 
dimensional. 2 2 ^i-point) constellation.! One way of 
- this is to generalize'cross constellations as follows, 
ao build up the.4D constellation from constituent 2D 
iellations.c(This idea appears both in H] and in [2].) 
, two dimensions, jcdhstruct a 3 r. 2"Z '-point constel- 
n C 2 with 2",r inner points";and 2",,'. "outer points, 
four-dimensional constellation C then consists of the 
tesian product C\of this' constellation with itself, ex^ 
i that pairs of outer, points are not allowed. Then there 
■> 2 " 4D points of the type! (inner,, inner), 2" of the 
i (inner, outer);, and 2^.7' of the type (outer, inner), 
i:n+ ,' ( 4D points' ih,all. The average power P(C) (per 
,dimensions)'is thus tf/* the average power of the in- 
. points plus 1 /4 the average power of the outer points. 
he2 ? > + " point 2D cross, constellation can be seen to be 
ilt up from constituent 3 r^.! -point ID constellations 
exactly this way^'uVJ-'f'. '- •-■ •'• ■ '•' 
Two specific scalable templates for appropriate 2D con- 
llations are illustrated in Fig.' 8(a) and (b):Tn Fig. 8(a), 
e inner points are the 2'!>p6ints in a 2D cross constella- 
m and the . outer points • are an additional 2" points 
omthe samejgrid.in the shaded outer region. The aver- 
se powerof the- inner: region is then-31/6; the average 
'lergyiof the outerregion can be computed to be 41/3; 
letefore, to the! accuracy of the continuous approxima- 
,on-;the average power . of the 4D region -R that defines 
he generalized cross constellation, per two dimensions, 
$ P(RV ='' 175/24. The volume' of this region is V{B) 
.2'iVio the shape-gain y, (R) = .V{Vi) l/2 /[6P{n)} is 
!^/il75 :(0: 15 idB), scarcely more than that of the 2D 
cn&TsTwo' suchi constellations are shown in [1, Fig. 8]; 
4fofid(C-) •"=- lr they actually, have; average power 
P(Cj;=>7tl25 (8!53 dB) and 29 (14.62 dB) for p - 5.5 
■fri bits.per two dimensions, respectively (compare 
fl^appioximatiohs'P(B), = 175/24.(8.63 dB) and 
175/6 (i4;65. dB ); respectively ). Similarly, the template 
feg; 8(b) . defines a square inner region with average 
pfe-32/3 arid ah outer region with average power 
ifewhich generates a 4D region with P(R) = 173/12 
tn*) = 2' 3 , so 7j (R) = 2 7 V 173(0 



(a) 



(b) 




Fie 8 Templates for inner and outer points of constituent . 2D eoBttUa- 
F l : f or construction of 4D generalized cross constellates: (a) with 
inner Snts fo^ tag a cross constellation, (b) with inner points fornnng 
a squa're "onStion, (c) circular constellation, with inner po.nts form- 
ing a circular constellation. 



i.20 dB), a small 



improvement over Fig. 8(a). The corresponding constel- 
lation for fi. = 6.5 actually has P(C) -14.25 .54 
dB) [compare the approximation P(R) = 173/12 (U.w 

dB If we wish to construct large 4D constellations, and do 
not care how complex the encoding of the constituent 2D 
constellation becomes, then the inner and outer points of 
the 2D constellation should be approximately delimited 
by circles of radius P 0 and P , , with R\ = ( 3 /2 ) R 0 so that 
the area of the inner circle is twice that of the surrounding 
annulus, as shown in Fig. 8(c). In this case, K(H) - 
2(ttRI) 2 , and P(R) can be calculated to be (11/16)P 0 » 
L that the shape gain is 7 ,(R> = K(H)^/6P(H) = 
ir2 7/2 /33 = 1.08 (0.32 dB). This is halfway between 
what is achieved with the templates of Fig. 8(a) or (b) and 
the 4-sphere limit of 0.46 dB. 

As with cross constellations, these generalized cross 
constellations are very suitable for use with coded QAM 
modems. They are relatively simple to implement, since 
the mapping complexity is approximately that of a 2D 
constellation, with a very simple "inner/outer" coding 
rule to create the 4D constellation from its 2D constitu- 
ents. The size of the constituent 2D constellation is 3 • 
2"~\ compared to the minimum possible size of 2 
so the shaping constellation expansion ratio CER^C) is 
only 3 /2 3/2 = 1 06. The peak-to-average power ratio is 
also well bounded; to the accuracy of the continuous ap- 
proximation, it is 480/175 = 2.74 for Fig. 8(a), 492/ 73 
= 2.84 for Fig. 8(b), and 24/11 = 2.18 for Fig. 8(c). 

The inner and outer regions of the three 2D templates 
of Fig. (8) all have four-way phase symmetry, so if points 
are chosen from the half-integer grid Z + ( 1 /2, 1 /2), 
the constituent 2D constellation will necessarily have 4- 
way symmetry. For use with a coset code of depth ji, the 
2D points in each of these regions must be evenly divided 
among the 2" cosets of P"Z 2 whose union is the half-in- 
teger grid. If ii is 2 (or less), then this requirement is au- 
tomatically satisfied. For p > 2; minor modifications of 
the templates of Fig. 8 may be needed to satisfy this re- 
quirement. (See [2] for examples.) 

We now give an example to show that an opportunistic 
secondary channel with a small fractional average bit rate 
may be added to generalized cross constellations, in such 
a way that all design constraints are respected. 

Consider the 48-point constituent 2D constellation par- 
titioned into 4 equal subsets that is shown in Fig. 9(a). Its 
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Fig. 9. Constituent 2D constellations for 4D generalized cross constella- 
tions with normalized bit rate /3 = 5.5 bits per two dimensions, evenly 
divided into 4 subsets, with inner points bold; (a) optimum 48-point con- 
stellation, (b) 52-point constellation with 4 paired outer points, to sup- 
port an opportunistic secondary channel with average bit rate 1/16 
bit/Hz. 



inner points are the 32-point cross constellation of Fig. 
3(a) [the 32 least-energy points from the half-integer grid], 
and its outer points have been chosen as the next 16 least- 
energy points, respecting quadrilateral symmetry. [This 
improves SNR efficiency very slightly over the 48-point 
constellation that would be obtained by using the template 
of Fig. 8(a)]. 

Fig. 9(b) is. a more symmetrical 52-point constellation, 
which adds the next 4 least-energy points. We may take 
4 pairs of outer points, each pair consisting of two points 
from the same subset, such as the pairs shown shaded in 
the figure, to create an opportunistic secondary channel 
whose average rate is (1/4) x (4/16) = 1/16 bit per 
two dimensions (e.g., a bit rate of 2400/16 = 150 bits/s 
if the baud rate is 2400) where 1/4 is the probability of 
an outer point and 4 / 16 is the fraction of outer points that 
are paired. The pairs are chosen to maintain 4-way sym- 
metry. Since the piaired points have the same energy, the 
average power using Fig. 9(b) is the same as that of Fig. 
9(a), namely, P{C) = 7 (8.45 dB). (The baseline aver- 
age power is 2 5,5 /6 = 7.54 (8.78 dB), so the shape gain 
is 1.08 (0.32 dB), as predicted for circular 4D general- 
ized cross constellations. ) Thus, in this example a low- 
rate opportunistic secondary channel costs nothing in SNR 
efficiency. 

C. Generalization to Arbitrary Binary Fractions 

We now give a further generalization that permits send- 
ing any number 0 = n + d/2 m of bits per two dimen- 
sions, where n is an integer that is "large enough" (to be 
defined later) and d is any integer in the range 0 < d < 
2 m , using two-dimensional constituent constellations. This 
requires the construction of an N dimensional constella- 
tion with 2 n(N/2)+d points where N = 2 m+1 . A construc- 
tion of this type for fractions with d = 1 has been pub- 
lished in [2]/ 

We first express the number d in the standard binary 
representation as d = d 0 + 2d v + • • • + 2 m ~ 1 d m _i, e.g., 
if d = 5, then d = 1 + 4. Let J (d ) be the set of j for 
which dj is nonzero: J (d ) = { j: d } = 1 } . Each j e J(d ) 
will correspond to a two-way partition of the constituent 



2D constellation C 2 . We may take these partitions in any 
order, but let us take they e J(d ) in ascending order. If 
the smallest j eJ(d) is /, then the ! first partition divides 
C 2 into two subsets whose sizes 1 are ; in the rati 
l:2^" J \ called "/-inner" and ?'/ -outer." Each ad- 
ditional j e j(d) further partitions each of these subsets 
into two smaller subsets in the ratio 1 :2~ {m Ti\ Conses 
quently , we finally arrive at a partition of C 2 > into 2 suK 
sets where K — \J(d ).\ is the, number of, nonzerQ Cpeffi* 
cients in the binary representation of d, ^consisting of fall 
possible combinations of /inner and /outer points fori? 
|7(rf)| different partitions. The ratio of the size of the 
largest subset to that of the smallest subset is the produj 
of the constituent ratios. ' ,; ' ' ; ".', T 

As in Section V-B, the points of each subset should^ 
chosen from the half-integer grid in such a way ^ as 
maintain four-way symmetry, and, for use with a code 
should be evenly divisible into 2? cosets'of R^i^h 
\k is the depth of the code ©; which may represent ap 
ditional constraint when fi > .2^\$W:$tyM 

The size of the largest subset is fixed asf 2 n /points? SL 
the smallest subset must haye/at least max [4; 2J 4 ] por 
to meet the two requirements of the previous' paragra 
this determines how large the integer ;n !must + beM v 

For example, let 2 m be 8,' and let derange from !l]# 
For d =. 4, the sizes of the two; subsets, of j C 2 '?arej2" 
2 rt_1 , and this method will reduce; to the generalized^ 
constellation construction for halfrinteger numbers||5| 
bits per two dimensions given in the previous section. 
d = 1 or 2, there are. two subsets with^sizes 2" arid{25 
or 2 n and 2"~ 2 , respectively j For rfr=i 3; 5, or 6^ 2 
vides into four subsets, with sizes 2"; 2"\~ 2 , 2"ji*anc| 
for J = 3, for example. The ;wqrst ,case is ' dl^T^ 
requires eight subsets with sizes. 2*;t25r?K^V?i! 
2"~ 4 , 2"~ 5 , and 2 n ~ 6 . Thus, in this last ca^eithe(i.nteg 
must be at least 8 (if ft <» 2 ).:jl§^^ft*^^a^' 

Fig. 10 is an example of a^QonsteHatipnl that can^g' 
to send 7 3/ 8 bits pertwo: dimension 
consists of four subsets of sizes'. ^8^;?a6,^nd!J^ 
The points in the subsets have be^vsd^t^in afl^ 
manner— i.e.,; as; the^owest^eirergy^ 
lected . However,* < quadrilateral ^sxmmetryi -has^ee 
spected; since the; constellatipiy^fid^ 
integer grid, this means r that ■ ^itowill|auton^icall| 
evenly into 4 cosets.of 2Z 2 i!(Oft,c^ 
not be divided; evenly* inta^:^ 
smallest subset has only .4 Roints^^^^4^ 

The shaping constelliation,expansi v q^^ 
| C 2 | /2 0 is very close to the minimum possible.^IJor 
4, CERjX C ) = (3 v 2^)/(2?^)^3/2^ 
as already noted. For d = !l,r2,>and?3^itasi l!0^ 
and 1.08. The worst case is d; = 'i7pwhere it;is*l|lS. 

The following code maps 2 rn n ' , ,+'fd bits into 
of 2 m points from the constituent:2D + consteUatiQn 
fined. For each j e J(d ), we divide the sequenc^' 
subsequences, each of length 2^1 j :~FoT-p^ci 
quence, we let one "designator bit^detenninefyF 
any element of that subsequence will be /outer.(If so 
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TABLE II 

Average Power to Send 0 Bits per Two Dimensions 



© O 0;0 O O.OjO o o o © 

, © p o o o o o.o o o o o © 
5 ©' o'XJ o'boo'ooooooe 
'•'©'o'b.dlo'o^bo oooo© 
© © o l plb\b^o il d l 6^o , .o l *o*o'® • \ 
, , • © do b^'bfbvb^b o o o o o © © 
k © obVo^o o o o o o o o © 
"® d'dyb'bWo o o o o ©. 
©'d'b'b^tf'd'o' ood o.o © © 

10jl8Oi^ihf»i^ii{% Sll&oVfcr sending 7 3/8 bits per 
^dimensions, divided 'into 'four subsets of .128, 32, 16, and 4 points. 

■ *iV** I . ii ioiyt»'i Ci hh& jih^«6v»t ^iJt V*V 3ftj.> > 
a further ■m.^/j bits ^determine which.one of the 2 m 7 
ments;wiil ! b^/^ will need m 

i fewer bits ( tc^ a pinner point, we 

11 spare exactly <the necessary number} of bits. ) When 
is procesVh^ of the 2 W ele- 

cts of the sequence ^ will liave been designated as>inner 
;-outer:for each^of ^the A J(d ) |» values of ./.The 
.mbef jSf/deiigna^ e J(d)\ 

;ir total, by the definition of d| f ' ; *is if. If all elements are 
nner for all j e J(d), then it takes n bits per element, 
2 m n bits in all, to ( specify . which of the 2" points in the 
rgest part of C 2 is to be used, so the total number of bits 
tA\&\2?jC^! t di^OT each element;; for each j e J(d ) for 
hich tHe eiehient : is'i/-bufcr; ? m;:-;y fewer bits have to be 
,ed to selectja pdirit;. so regardless of the values of the 
jsignator bits,- the.total^humber of bits uised is 2 m n + d. 
his code is simple; to implement (if riot to describe). 
With this code, the probability thai any given 2D point 
j-outer i^ e J(d). For instance, 

, r j = m ^he f probability that a 2D point is j-buter 
, i /4, as detennined/ea of selecting 

point from* a particular one of the 2* subsets is the prod- 
ct of K constituent prbb'abilities; where each constituent 
robab'ility f is '"a probability, of. being J-inner or y'-outer, ac- 
ording to'the composition of the subset. The probability 
lat any particular point in 'a subset is selected is the prob- 
bilitjf of thai subset (as'determiried above) divided by the 
umber of points in that subset. The average power per 
vo dimensions is' the' product of 2 K terms 1 , one for each 
tibset, each term being the product of the probability of 
iat subset with the average energy of points in that sub- 
ct^To 1 minimize' the average power, the constellation 
hould be partitioned, so that the points of larger energy 
ire Assigned so as to have lower probability. 

For* example, the probabilities of the four subsets of 
■oints/in Fig. 10 are ( 15/16) X (7/8) for the 128 "in- 
ier-iriner" points, (15/16) X (1/8) for the 32 "inner- 
iuterV points, (1/16") x (7/8) for the 16 "outer-inner" 
joints,! and (1/16) x ( 1/8) for the 4 4 ' outer-outer' ' 
,x)ints. Since the average powers of these four subsets are 



a icy 

7 128 

7 1/8 144= 128+ 16 

7 1/4 160- 128 + 32 



7 3/8 180=128 + 32+ 16 + 4 1.08 



CEBs EXQ JtdBl ElEi IdBJL 

1.00 20.44 13.10 20.37 13.09 

1.03 21.85 13.39 21.81 13.39 

1.05 23.57 13.72 23.56 13.72 

25.49 14.06 25.45 14.06 



■him EAB 
0.20 2.00 



71/2 192 = 128 + 64 



1.06 28.05 14.48 28.01 14.47 



0.28 
0.32 
0.36 
0.32 



2.10 
2.16 
2.25 
2.18 



7 5/8 216= 128 + 64+ 16 + 8 l!o9 30.30 14.81 30.30 14.81 0.36 2.27 

73/4 240= 128 + 64 + 32 + 16 1.11 33.07 15.19 33.03 15.19 0.36 2.31 

77/8 270= 128 + 64 + 32 + 16 1.15 35.96 15.56 35.93 15.55 0.37 2.39 
+ 16 + 8 + 4 + 2 

8 256 1-00 40.69 16.09 40.74 16.10 0.20 2.00 



20.44, 45.5, 54, and 56.5, respectively, the average 
power of the constellation is P(C) = 25.49 (14.06 dB). 

Table II gives values of the smallest average power 
P(C) that can be obtained by this construction for num- 
bers |3 of bits per two dimensions equal to 0 = 7 + d/% 
where d ranges from 0 to 7. The sets have been chosen as 
in Fig. 10, using the least-energy points available from 
the grid Z 2 + ( 1 /2, 1 /2). Quadrilateral symmetry is at- 
tainable except for the case 0 = 7 7/8. We also give for 
comparison an asymptotic average power P(U) 9 using 
sets bounded by circles of increasing radius as in Fig. 8(c), 
and the corresponding shape gain y s (B). We see that the 
approximation is already accurate to within 0.01 dB. Fi- 
nally, we give the shaping constellation expansion ratio 
CER 5 (R) and the asymptotic peak-to-average power ratio 
PAR(^). (The asymptotic values of CER,(fi); y s (R), 
and PAR(B) given in Table II are valid for j8 = n + d/% 
for any integer n, not just n = 7. ) 

It seems that finer constellation partitioning leads to 
slightly greater shape gains, up to approximately 0.37 dB, 
but the difference beyond the half-integral method (0.32 
dB) is small. The two-way (inner/outer) partitions used 
for jS = 7 1 /4 and 7 1 /8 yield about the same or slightly 
less gain. The more composite the constellation, the 
higher the CER 5 and therefore PAR [by the relation (*)], 
but in general it is a virtue of this construction that the 
CERj and PAR for the constituent 2D constellations are 
close to their minimum possible values, as noted in Wei 
[2] where other examples of such constellations satisfying 
various constraints are also given. The mapping complex- 
ity is also roughly that of a 2D constellation. 

These methods can be generalized to construct /V di- 
mensional constellations with 2 nM ' d points from constit- 
uent partitioned (N/M) dimensional constellations. From 
the examples investigated so far, it appears that in general 
the resulting shape gains are limited to within about 0.1 
to 0.2 dB more than the shape gains obtainable with 
(N/M) dimensional constellations, as we have already 
seen with N/M = 2. 

VI. Conclusion 

The principal purposes of this paper have been to pro- 
vide an overview of multidimensional constellations, with 
a full discussion of characteristics that are desirable in 
practice; to develop figures of merit, principally the shape 
gain and shaping constellation expansion ratio, that de- 
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pend only on the shape of the constellation in N space, 
and that are more or less decoupled from the gain and 
expansion factors due to coding; to develop bounds on 
achievable shape gain as a function of dimension and of 
constellation expansion; and finally to present a class of 
constellations, namely generalized cross constellations, 
that are in fact good for practical applications. 

Generalized cross constellations have most or all of the 
characteristics desired. They support fractional normal- 
ized bit rates with very small shaping constellation ex- 
pansion ratios, usually less than 1.1, and with moderate 
shape gains, less than 0.4 dB. However, the bound plot- 
ted in Fig. 5 shows that in fact a shape gain of as much 
as 1.0 dB is achievable in principle with a shaping con- 
stellation expansion ratio of only 1.1. Thus there is room 
for considerable improvement in shape gain, perhaps with 
some sacrifice of other desirable constellation character- 
istics. The Voronoi constellations to be discussed in [Part 
III can achieve considerably greater shape gains, but gen- 
erally at the cost of greater constellation expansion and 
implementation complexity. 
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A Coherent Digital Amplitude and 
Phase Modulation Scheme* 

C. N. CAMPOPIANOf, member, ike, and B. G. GLAZERf, member, ire 



Summary— In this paper, we present a technique for a combination 
of amplitude and phase modulation for digital communication. This 
technique assumes phase coherence between transmitter and 
receiver. For purposes of analysis, we also assume baud synchroni- 
zation. The technique involves the simultaneous and independent 
amplitude modulation of two carriers of the same frequency which 
are in quadrature with each other. The demodulation process 
consists of the separation of the in-phase and quadrature components 
followed by amplitude detection. This scheme appears to be easier 
to implement than either the system studied by C. R. Cahn or the 
system investigated by J. C. Hancock and R. W. Lucky. Further- 
more this system is comparable in reliability to the latter scheme. 

I. Introduction 

THIS PAPER is concerned with digital communica- 
tion over channels perturbed by additive, narrow- 
band, zero-mean Gaussian noise. For such channels, 
binary communication schemes make inefficient use of 
the channel capacity when the signal to noise ratio is high. 
For this reason, there is today considerable interest in 
nonbinary communication techniques. The results of 
Shannon [1] and of Blachman [2] suggest that a combina- 
tion of amplitude modulation (AM) and phase modulation 
(PM) is especially to be considered in the design of systems 
for such channels when the SNR is high. 

Combinations of digital amplitude and phase modula- 
tion (AM-PM) have recently been studied by Cahn [31 and 
by Hancock and Lucky [4]. Cahn has examined digital 
phase modulation alone [5] as well as AM-PM. Following 
the terminology of Hancock and Lucky we shall call the 
AM-PM system studied by Cahn the Type I system. 
In the Type I system, the same number of phase positions 
is used with each amplitude level. The possibility of im- 
proving efficiency by using more phase positions with the 
larger amplitude levels was suggested by Taber (see [3], 
p. 151). Hancock and Lucky [4] investigated the latter 
possibility in detail. For. fixed average signal power they 
determined that number of phase positions for each 
amplitude level which would yield a minimum probability 
of error. A similar optimization process was carried out 
for fixed peak power. We shall use the terminology of 
Hancock and Lucky in referring to the AM-PM scheme 
which they investigated as the Type II system. 

In the present paper we present an AM-PM system 
which we call the Type III system. The Type III system 
is comparable in reliability to the Type II system. In 
addition, it may be fairly readily implemented. In fact, 
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although the initial motivation for the Type III. sysfo 
was the problem of modulation and demodulatio nk dtl 
large signaling alphabets, it appears that it may bejrecon 
mended even for signaling alphabets of < moderate^a 
The Type III system amounts essentially to ■ thejamgtj 
tude modulation and demodulation of two carriers! 
have the same frequency but are in quadrature withleael 
other. The simplicity of mechanization of this|sche 
lies in its reliance upon amplitude modulation ;and| 
modulation. It must be emphasized, however, yiia|~ 
Type III system requires phase coherence whenS9 
Type I and Type II systems can function in a different: 
coherent mode, . 

In the present paper, we describe the Type H|jj 
give analytical expressions for the probability ofkerjTjffl 
this system, and compare the Type ( III- systen^OT^ 
systems of Type I and Type II. It .is/conyenien^ 
following exposition, to begin by, describing^" 
modulation and demodulation process -that includS 
special cases, the three systems considered in v ^ffi^ 
This is done in Section II. In Section III , the^repl 
Section II are specialized to systems of Type|Il 
III. Finally, in Section IV we briefly, compare^g 
systems. ■• . .. \ <: ./ 



II. 



♦••iff 



Amplitude and Phase Modulation! 
and Demodulation <;>; 

In the present section, we list, our/. basic; assii 
and discuss a general 'modulation-demodulatior 
that includes as special cases all others, treatgey 
paper. ; ■ , ] 

A. The Signaling Alphabet and the Channeti:i-& 

We suppose that there is phase coherence, betwg 
transmitter and the receiver. It is further ''assume! 
the receiver knows both the time of arrival off 
and the duration of the signal. The transmitterji 
disposal M signals for transmission. These^ag 
each be of finite duration, T seconds. We canMiiS 
set of M signals as . 

$i(t) - A,- cos (co/ — 0 t ) • 

= d cos <*)t + Si sin at M 

where 0 < t < T and i = 1, 2, - - : , M. TtoffiEfl 
{Si(t) f • " , s M (t)\ is called the signaling alph 
called the size of the alphabet. We use the fai 
representation of the signals Si(t) as indica tedlSI 
We call the plane of Fig. 1, the phasor plane.. 
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i (pUADRATURE* 
AXIS) 




i sj(t)--Ci COS wt + Si SIN wt 



C| (IN-PHASE AXIS) 

Fig. l— Phasor representation of the signaling alphabet. 

We assume that the channel is characterized by addi- 
;e,Tzero-mean,. e narrow-band Gaussian noise of mean 
tmN. The noise' waveform, n(0, is represented as 

n(t) = x(t) cos o)t + y(t) sin <at 

here x(t) and y(t) are,- independent samples from an 
tsemble of 'functions such that the ensemble averages 
^)p(z 2 (t)) and (y 2 (t)) are equal to N while (x(t)) t 
( (tyj$(n(t)} are'* zero. It is assumed that x{t) and y(t) 
re slowly varying iri M the sense that they do not change 
ppreciably in an. interval of T seconds. Ignoring propa- 
itiortjtime, we may write the received waveform, r(0, as 

%' r(t) = Si (i) + n(t) (fl<t<T). (2) 

;. The Detection Scheme 

Suppose that one of the signals in (1), say is 
ransmitted and that r(t) in (2) is received. The received 
.aveform is then processed as indicated in Fig. 2. The 
rsult of this process is the decision vector (u, v) with 
umponents 



u = d + 



v = S< + 



f fyV)dt. 



(3) 



The decision scheme is conveniently described in terms 
J the phasor plane. Consider the vector of (3) as a point 
,n the phasor plane. To each signal point (C7 f S { ) we 
iS&ciate-a set of points, say D if called the detection 
Won for Si(t). The vector (w, v) is decoded as *,(*), if 
and only if, it lies in D<. D l} D 2 , • • • 7 D M are pairwise 
disjoint, but their union may not exhaust the phasor plane. 
If (w, v) lies in none of the sets D i} then no decision is made. 

In the system of Type III the detection regions will be 
■imple rectangular regions with edges parallel to the axes 
A the phasor plane so that a decision can be made directly 
in terms of thresholds on u and v. In the systems of Type I 
ind II, however, the detection regions arc expressible in 
the form H, < R < R 2 , 0, < 0 < 0* where 



-£-cosut 



LOW - PASS 


T 


FILTER 






L 






LOW- PASS 




FILTER 





DECISION 
OEVICE 



-♦OUTPUT 



Fig. 2— Quadrature detection for AM-PM waveforms. 

Hence the decision device for the latter systems will have 
to include or be preceded by computing elements that 
yield R and 0. This explains the complexity of the imple- 
mentation of systems of Types I and II compared to the 
relative simplicity of implementation of the Type III 
system. 

C. Probability of Error 

We suppose that each of the M signals in (1) has prob- 
ability l/M of being selected for transmission inde- 
pendently of past transmissions. For i = 1, 2, • • • , M 9 
Pi(e) denotes the conditional probability of error when 
$i(t) is sent; i.e. } 

p.(e) = Prob {(w, v) is not in D { \ s t (/) is sent}. 

The (average) probability of error, P(e), is defined as 

If P is any of the above probabilities, then E, P and P 
will represent respectively a lower bound, an upper bound, 
and an approximation to P. 

The quantities u and v in (3) are samples of random 
variables, say U and V respectively. From our assumptions 
it is readily seen that U and V are independent, normally 
distributed random variables with identical variance, N, 
and with means C, and S t -, respectively, where (C tf Si) 
represents the transmitted signal. Then p f (w, v), the 
conditional probability density function of U, V is 



r = V?T7, 



- tan" 1 [f c 



P.-(*> V ^2^N 6XP 



{- 



(u - C,) 2 + (v - S { 



2N 



4 



(5) 



(4) 



Probabilities computed from (5) will ordinarily involve 
A i} Ci and S { . However, we shall relate probabilities to 
average signal power, S, where 

e _ JL fill - ± f ^ + ff) 
M hi 2 M h 2 

In fact, in Section TTT, it is shown that for each of three 
separate AM-PM systems, the probability of error, 
7 >(<) (e), is given approximately as 

P"\e) » K { exp (-7?, |) = exp (-7?, | + In Jv.) (6) 
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where 2f< varies slowly with S/iV relative to exp For this case, Cahn has shown that 
(— Ei [S/N]) and both K { and E { depend upon the par- 
ticular system discussed. E iy on the other hand, is inde- 
pendent of S and AT. In order to compare two systems, 
say system I and system 2, it is not uncommon to de- 
termine the ratio 



p(e) 2 1 



[• - *WI -;-)] 



cxp [—(S/N) sin" ir/n] 



r = 



(S/N), 
(S/N). 



(7) 



hrS 



sin 2 w/n 



where 



where (S/N) { (i = 1, 2) is that SNR for which system i 
has probability of error p } where p is the same value for 
both systems. If (6) holds, and if K t is a slowly varying 

function of (8/N)u then for sufficiently small p or suffi- For the case n > 2 > one can > b ^ a shght^ extensioj 

ciently large (S/N) t . the ratio r in (7) is given approxi- Cahn ' s anal y sis > demonstrate that the second term rip 

mately as is in fact an upper bound for P(e). For practical cases]i 

even a tight upper bound. In fact, following Cahn, wel 



Therefore, the quantity E { in (6) provides a convenient 
measure for comparing systems for which (6) holds. 
From (6), we have 



where 



Ei = Urn 

<S/JV)-a> 



f -lnP'%) "] 
L S/N 1 



(8) 



We shall call E { in (6) and (8) the detection reliability of 
system i. This concept is somewhat similar to the concept 
of " reliability' 1 utilized by Peterson [6] in the theory of 
error-correcting codes. 



P( e ) = 1 - [ p(6) dd 

J -w/n 



Furthermore, for x > 0, it is known [7] that' 



1 - 



V2 



7T X 



III. Specific AM-PM Systems , . , . . 

Combining the above, we conclude that whenj. 

In this section, the definitions and results of Section II £( e ) < < p( 6 ) where 

are apphed to three distinct AM-PM systems. 



A. The Type I System 

This is the system studied by Cahn [3]. In this system, 
the set of M signals is 



P(e) 



8 k( Q) = A t cos (wt - 2{i - , 0 < I < T; 

A,>0 



E(e) = P(e) - 



n 

2*. S/N '. 



(9) 



A„ = A, + 2{k - \)A, sin - , 

n 



Even for moderate values of S/N, P(e) andJE 
tight. For example, when S/N == 10, P(e) ^ 
(0.012) JP(e). N . 
Case 2: m > 1. Here D ki is defined as the regi(in[j 
where k = 1, 2, ••• , m and Af = run. The optimum phasor plane whose polar coordinates satisfy thea 
values of m and n for fixed peak signal power or fixed 
average signal power are given in Tables I and II, re- 
spectively, of [3] for the case in which M is a power of 2. 
Two distinct cases arise, namely, m = 1 and m > 1, 
which we shall consider separately. 

Case 1: m = 1. Let the polar coordinates of points in 
the phasor plane be 0). Then the detection region 

D u of signal *„(<) is defined as the set points whose polar fo;' example) the detection region Du indicated £ 

By integrating t>) over the circle, the squwj^ 
shaded half plane in Fig. 3, we obtain, respective^ 



\R - A k |'< A, sin- 

1 J. n 



0 - 



2ir(i - 1) 



n 



<=• 

n 



A variety of approximations are obtainable here; < 



coordinates satisfy the condition 
2(t - 1W 



0 - 



<=• 
n 



distinct approximations to P(e), namely 



.Mil 



ADRATURE 
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B. The Type II System 

The Type II system was investigated by Hancock and 
Lucky [4]. The signaling alphabet for this system may be 
represented as follows: 




(IN- PHASE 
AXIS) 



/ (i - l)27r\ 
s ki (t) - A k cos [cot ) > 



0 < t<T\ 



i =1,2, 



* Fig. 3— The detection region Dm. 

i - * ■ . *' 

* v " » 

* * ' •' 



1 + 2(fc - Dsin^] , 

fc = 1,2, 



, m; 



(ii) 



2$ 



Ai sin - 
n 



( 2exp ( k -^sm -j 



n, = 3 

n 2 = ni + 6 

n m _i = n m _ 2 + 6 



= M — £n». 



sin - 

n 



In (11), m, the number of amplitude levels, is the unique 
positive integer such that 

M - 9 < "f[3 + 6(t - 1)] < M " 1, 



session P(e) has ' . been given by Cahn in [3]. 
•suits of Cases 1 and 2 can be summarized in the 
rform: \ " { \. 

ne) ~ V^ES/N 

2em(-E S/N) 



or equivalently, 

M - 9 



<(m- If < 



M - 1 



VvE S/N 



(m = 1) 
(m > 1) 



T, A\ . 2 IT 

B = 2S Sm n , 

fables I and II of [3] .the reliability E has been 
cd and'recorded in Table I. 

TABLE I 

Reliability of Systems of Type I, II and III 
Detection Reliability, E 



n k is the number of phase positions on the fcth amplitude 
level. We have computed n k according to the prescription 
of Hancock and Lucky for values of M = 8, 16, 32, 64, 
128, and 256. The results are summarized in Table II. 
We'recali that for the Type II system the parameters w, 
and m were selected so that the probability of a phase 
error and the probability of an amplitude error are equal 
and constant for all amplitude levels and so that a mini- 
mum value of average probability of error is attained. 

TABLE II 

Amplitude Levels and Phase Positions for the Type II 
System 





Type I System 


Jiet 


Minimum 


Minimum 


u 


Peak 


Average 




Power 


Power 


l 


0.50 


0.50 


s 


0.14645 


0.14645 


f) 


0.07116 


0.07116 




0.02595 


0.02740 


a ;;!■■: 


0.01407 


0.01407 


0.00557 


0.00595 




0.00315 


0.00315 



Type II 
System 



0.148802 
0.080056 
0.042776 
0.022008 
0.011231 
0.005712 



Type III 
System 



0.500000 
0.181818 
0,100000 
0.050000 
0.023810 
0.012195 
0.005882 



Amplitude 
Level, k 


Number of phase positions, n*, on the / 
amplitude level. 




M =8 


M = 16 


M = 32 


M = 64 


M = 128 


M 


-256 


1 

2 
3 
4 
5 
6 
7 
8 
9 

10 


3 
5 


3 
<) 
1 


3 
9 
15 

r> 


3 
9 
15 
21 
16 


3 
9 
15 
21 
27 
33 
20 


3 
9 
15 
21 
27 
33 
39 
45 
51 
13 



IRE TRANSACTIONS ON COMMUNICATIONS SYSTEMS 



94 

. The detection region, D ki , for signal s ki (t) is taken to 
be the set of all points in the phasor plane whose polar 
codrdinates satisfy the condition 



\R — A k 



7T 

< Ai sin — 



9 _ (* - m* 



Then, as in Section III-A, we obtain P(e) « ^(e) where 
2exp^-^sm -) 



He) = 



I T2 

/7rAi . 2 

\ 2N n x 



nj _ 2 exp {-E S/N) 
VirE S/N 



From Table II we have computed the appropriate values 
of the detection reliability E and have recorded them in 
Table I. 

C. The Type III System 

The signaling set for the Type III system is given by 
(1) with 

d = CiA, Si = St A 

where A is a fixed positive number while c< and s< are 
appropriately fixed integers. In this paper we consider 
only the cases M = 4, 8, 16, 32, 64, 128 and 256. The 
signal points for these cases are illustrated in Fig. 4. In 
the case M = 4 it is understood that each of the four 
quadrants is the detection region for the signal point 
contained in it. In all other cases, we stipulate that the 
detection region for the signal point (c<A, s<A) will be 
the open square, bounded by the lines x = c»A d= A, 
y = SiA ±: A where x and y correspond to abscissa and 
ordinate, respectively, in the phasor plane. We discuss 
P(e) in two separate cases, namely, M = 4 and M > 4. 

Case 1: M = 4. Let the signal s x (t) correspond to 
C x = Si = A in (1). Then D u the detection region for 
is the region x > 0, y > 0 in the phasor plane. 

Therefore, 



Pi(e) - 1 



Hence, by symmetry, 



(r-yl) 2 + (?/- A)' \ 
2N ) 



dx dy. 



exp (-(1/2) S/N) 
V(t/2) S/tf 



where <S = A 2 . 

Case 2: M = 8, 16, 32, 64, 128, 256. In these cases we 
readily obtain 

P ® - 1 " [ 2 *(^) " 'I' 
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Fig. 4— Signal points and detection regions for the- 
system. 1 ' . ■ r ^ 

. .. -..^f 

From Fig. 4 we can express P(e) in jbermslof 
detection reliability, E, as follows; ' ' 

Values of the detection reliabihty are given 
Note that no serious attempt wa<s r ;made^ 
the signal points in an optimum f way; to mi 
peak power or average power. Slight modi fiMtS T 
distributions shown in Fig/ 4 were' aittemDtedi 
only to minor changes in detection- rehabil^Y- 

IV. Comparison of Type I, II and IIIiSxE 

A very convenient and common^way^iOl 
modulation and demodulation systems is to^com 
SNR in decibels which the systems require 
satisfy a specified probability of error J^gs 
comparison is readily accomplished in .thepr" 
by giving a table of values of the detection 1 
E } in decibels, i.e., 10 log J0 E. Table III co^ 
of 10 log 10 E for the three systems discussedlpK 
system enjoys an advantage of from 0 to 3.5 
SNR relative to the Type I system. On theTSK 
the Type II and Type III systems differ' by 
decibel. 
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; TABLE III 
1 .tectk3n Reliability (in Decibels) of AM-PM Systems 



A SIN ut 



1 



Type I 
(Minimum 
rv Peak •■: 
Power) 



Type I 
(Mini- 
mum 
Aver- 
: age 
Power) 



X -3.0 

r* -8.3 

V -11.5 
? -15.9 ' ' 

», -i8.5 , 

&-22.5 ♦ / 
^-25.0,V. 

^i'7-10 logioM 



-3.0 

-8.3 

-11.5 

-15.6 
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Fig. 5— Type III transmitter. 
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even for M - 256, the Type III system requires only two 
4-bit quantizers. It is concluded, therefore, that among 
the phase coherent systems considered, the Type III 
uu^. m, WB u« ^ — , . » - offers by a small margin the best performance and by a 

Pp&to a linear pbwer^amplifier. The transmitters su b s tantial margin the simplest mechanization, 
^ll' orlTypeJI while roughly the same in block 
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Abstract 

Discrete multitone (DMT) has been considered for various 
wireline, broadband communication systems. However, it is 
well-known that this modulation scheme exhibits high peak- 
to-average power ratio (PAR). We introduce a novel approach 
to reduce the peak power using constellation shaping. We 
propose a method based on a hyper-spherical boundary, to 
shape ordinary DMT constellations. This method reduces 
the peak power by 3-6 dB with no loss of data rate and with 
virtually identical symbol error rate as conventional systems 
when no clipping is applied to the signal. When clipping is 
applied, it provides at least 2 dB of peak-power reduction. 

1 Introduction 

In a discrete multitone (DMT) system, the output time signal, 
x (£), is generated by 

x (t) = Re (X k ) • cos (2irf k t) + Im (X k ) - sin (2irf k t) (1) 

k 

for 0 < k < M where the X k are constellation points from 
QAM constellations of roughly equal average energy 1 and 
f k = -jk^ . The sampled version of the signal can be efficiently 
computed by an fast Fourier transform. 

However, when all the X k take on similar values, the con- 
stellation points interfere constructively to produce a large 
time-sample magnitude. In fact, the time-domain peak power 
grows linearly as the number of frequency bands. Thus, the 
time samples may occasionally have very high output levels, 
which leads to the requirement of an expensive, highly linear, 
and power-inefficient analog front end (AFE) and/or a clip- 
ping mechanism to limit the time-sample magnitude, which 
leads to impulsive noise and performance degradation. High 

*This work is supported by a Tellabs fellowship at the University of 
Illinois. 

1 Without fine power adjustment used in actual DMT systems, the 
average energies of these QAM constellations are exactly equal. 



peak-to-average ratio (PAR) is arguably the greatest draw- 
back of DMT. In order to maintain an acceptable level of 
system reliability, the output signal is usually clipped at a 
high PAR which leads to higher power consumption. In fact, 
the AFE consumes a substantial amount of power for a typical 
DSL modem. This power consumption requirement may lead 
to various difficulties in system designs. For example, a USB 
DSL modem may need an extra power supply when the total 
power consumption of the modem exceeds the amount that 
the bus may supply. Also, the telephone company imposes a 
maximum spatial density of power consumption in the cen- 
tral office (CO). Lower power consumption allows more DSL 
modems to be more densely in the CO which leads to lower 
rental cost. 

Several authors [1], [2] exploit some unused bandwidth in 
the DMT system to cancel out the large-amplitude samples 
generated by the other information-bearing channels. This 
approach requires frequency spectrum that could otherwise 
be used for transmission of information. One class of methods 
clips the transmitted signal intelligently so that the degrada- 
tion due to the impulsive noise is minimized, or the receiver 
uses a priori knowledge of the clipping mechanism to decode 
the signal reliably [3], [1]. However, this class of methods de- 
grades the performance of the overall system by introducing 
artificial noise. 

Numerous methods have been proposed to reduce the PAR 
of another multicarrier modulation scheme - orthogonal fre- 
quency division multiplexing (OFDM). Currently, most of 
the existing schemes approach this problem by modifying the 
transmitted signal. Various block coding schemes have been 
used [4], [5], [6]. These schemes lower the overall data rate 
of the system and essentially trade bandwidth for lower peak 
amplitude. Others [7], [8] relax the constellation by allowing 
possibly more than one choice for each constellation point. 
The signals generated by these methods may occasionally vi- 
olate the power constraint. Finally, there are methods which 
adjust the phase of the constellation points in order to avoid 
phase alignments that lead to large magnitude time samples 
[9], [10]. In addition, [11] points out that evaluating PAR 
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reduction techniques in the digital domain often leads to op- 
timistic values. At times, over 3 dB are of PAR reduction are 
fictitious. A phase rotation scheme with over 4 dB of PAR 
reduction in the discrete-time domain was shown (by [11]) to 
have less than 1 dB of reduction when the performance mea- 
surement is done in the continuous-time domain. We refer to 
this phenomenon as peak regrowth because the peak seems to 
reappear after going from the discrete-time to the continuous- 
time domain. 

Unfortunately, most of these schemes cannot be easily ap- 
ply to DMT. Most coding schemes require PSK in each tone. 
Phase rotation do not perform well as the per-tone constella- 
tion become large. In addition, many of these schemes assume 
identical and fixed constellation in each tone. DMT modem 
designs its constellation during the initialization according to 
the SNR of the subbands. Most in-band signal-modification 
schemes cannot be initialized in such short time. 

In this paper, we propose a method for peak-power reduc- 
tion in DMT systems based on constellation shaping based 
on the shell mapping used in the V.34 modems. It shapes 
the constellation into a hypersphere in order to reduces the 
peak, without lowering the data rate or increasing the symbol 
error rate. It yields up to 6 dB of peak-power reduction. We 
will show theoretically that the proposed scheme is immune 
to peak regrowth. For practical channel parameters with clip- 
ping, it yields about 2 dB of peak-power reduction. 



2 Constellation Shaping 

For ordinary DMT systems, we may consider the QAM con- 
stellations from all channels jointly as a real-valued 2N-D 
lattice in the frequency domain, and we choose a bound- 
ary in the 2N-D space such that all lattice points inside the 
boundary are considered to belong to the constellation. Typi- 
cally, the boundary is square or rectangular in nature because 
the input bits can be easily divided into subgroups, each of 
which indexes one frequency dimension. However, when a 
constellation-dependent metric needs to be minimized, a rect- 
angular constellation is often suboptimal [12]. 2 Instead, a 
properly-chosen boundary may lead to better performance in 
terms of the particular metric without lowering the data rate 
or increasing the symbol error rate. In the V.34 modem stan- 
dard, the purpose is to reduce the average power. In our case, 
the goal is to minimize the peak power, or the co-norm, Hxjl^, 
in the time-domain. We illustrate this idea with an example. 

Example 1 Suppose we use a 2-point inverse DFT to cre- 
ate a two-channel real-valued DMT signal. The modulation 



matrix is given by: 



2 In general, constellation shaping is a preceding technique in which 
a proper boundary for a multi-dimensional constellation is selected to 
match a certain metric of interest. See [12] for details. 
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A 16-point, square constellation (see Figure 1) is used for the 
unshaped baseline case. (Note that this is not a QAM constel- 
lation, but a 2-D joint real-valued cross-constellation of fre- 
quencies 1 and 2.) //X = [ Xi Xi ] is a constellation point 
belonging to the constellation C, we wish to find the quan- 
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min 
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^ax||AivX|| ( 



■) = ^C € ^ w| «)- (Note 

that A N (C) - {x : x = A N X for VX G C}.) One choice that 
leads to better performance is shown in Figure 2. In Fig- 
ures 3 and 4, the corresponding time-domain sample vector, 
x = [ xi x 2 ], for each constellation point is plotted. The 
dotted lines indicate the equi-metric line for the maximum 
time-sample magnitude for the unshaped and shaped case. The 
overall PAR reduction is 1.5 dB. As we will see later, for 
higher dimensions, the PAR reduction is larger. 



>^6 




Figure 1: The unshaped, 2-D constellation. 

The constellation boundary is usually determined by the 
metric that we want to optimize. In this paper, we propose a 
hyper-spherical boundary. 

3 Hyper-Spherical Shaping 

Spherically-shaped constellations are used in the ITU V.34 
modem standard to reduce the average power of the constel- 
lation while neither lowering the data rate nor increasing the 
symbol error rate. For the peak-power reduction problem, 
a spherical constellation is also desirable because a spherical 
volume is invariant under rotation and reflection. Therefore, 
the radius of the spherical boundary is the maximum time- 
sample magnitude when a unitary transformation is applied 
to the constellation points. (Otherwise, we must scale by the 
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Figure 2: The shaped 2-D constellation in the 2-D real fre- Figure 4: The time samples generated by the shaped constel- 
quency domain.. lation. Via shaping, the largest peak amplitude is reduced to 

1.25. 



the number of transmitting tones. In order words, the 
maximum magnitude of the complex envelope of the signal 
is bounded by the /2-norm of the constellation point. If we 
limit this upper-bound to a maximum value of /?, we have a 
constellation with each point, X, satisfying 




Figure 3: The time samples generated by the unshaped con- 
stellation. Note that the worst-case amplitude is 1.5. 

matrix norm.) By choosing a spherical boundary, the peak 
time sample will be no larger than the peak magnitude of 
the per-channel base constellation. In other words, a hyper- 
spherical constellation is "amplification free" under a unitary 
transformation. 

In the time-domain, we may bound the magnitude of the 
continuous-time signal, x(t), using the Cauchy-Schwartz in- 
equality 



|x(i)| = 



^ Re (Xfc) • cos (27r/fct) + Im (X k ) ■ sin (27r/ fc t) 



= Kx ) d t >|<||x|| 2 ||d t || 2 = ^||x|| 2 



(3) 
(4) 



where X = ( •-■ ReXfc •-• ImX* ••• ), d t = 
( •-■ cos(27r/ fc t) ••• sin (27r/ fc i) •-• ), and N is 



|x(i)|<^||X|| 2 </3 



(5) 



or all constellation points must be inside a 27V-D hypersphere 
of radius 

3.1 Asymptotic performance 

In order to evaluate the theoretical performance of this 
scheme, we use the fact that the volume of an iV-D hyper- 
sphere with a radius of p is given by [13] 



vol(S N ) = 



7T 2 



r(f + 



(6) 



For simplicity, we ignore the fine power adjustment used to 
tune the bit error rate of each individual tone to the desired 
level. We assume that there are N square, QAM constella- 
tions each with unit variance. When the constellation size 
is large, we may apply the continuous approximation [12] 
and treat each QAM constellation as uniform random vari- 
able over a square region with an edge of >/6. The maximum 
magnitude of the unshaped constellation is given by 3 
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£ fc Re(X*)-cos(27r/*<) + 
Im (Xfc)sin (2irf k t) 



^cos(fc0)+sin (k6) 



(7) 

(8) 
(9) 



3 Experimentally, it can be shown that M uns haped ~ 1.6JV 
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Since we require that the data rate of the shaped system re- 
mains the same, the identical data-rate requirement is equiva- 
lent to requiring both the unshaped hypercube and the shaped 
hypersphere to have the same volume in the iV-D space. 
Therefore, we compute the radius of a 2N-D sphere with the 
volume of U 2N . This yields the equation 



e N = 



7T 



N 



r(jv+i)' 



n 2N 



(10) 



or the radius of the shaped constellation is given by 



P=M shaped = ^[T(N + l)\™ (11) 
Thus, the overall peak-power reduction, G sp h ere (iV), is 
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As N increases, we approach asymptotically 



(12) 

(13) 
(14) 
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(15) 
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N->ooV 2(JV + 1) 
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Therefore, the peak-power reduction approaches asymptoti- 
cally to ^/f, or 6.304 dB. 4 

4 Results 
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The peak-power reduction of the hyper-spherical shaping un- 
der various numbers of channels and constellation sizes (iden- 
tical constellation for each tone) are shown in Figure 5. We 5 Conclusion 
can see that, for large constellations, it approaches the the- 
oretical limit, G sphere (N), rather rapidly. The peak-power 
reduction for typical constellation sizes and numbers of chan- 
nels is about 3 to 6 dB. 



Figure 5: The peak-power reduction vs. the number of chan- 
nels for several different constellation sizes. 



Next, we use two channels that one may encounter in an 
actual ADSL. Figure 6 shows the bit allocation for the 31- 
channel and 215-channel cases. The amount of peak-power 
reduction is 4.794 dB for the 31-channel case and 5.224 dB 
for the 215-channel case. 

In practice, we can tolerate a small amount of clipping as 
long as it does not increase BER significantly and the signal 
is compliant with the PSD mask. The ANSI ADSL standard 
recommends a clipping rate lower than 10 -7 . To prevent fre- 
quent CRC frame errors, a value as small as 10~ 10 is used. 
This requirement allows the AFE to clip the signal at least 5.2 
times of the rms value of the signal. Thus, with clipping, the 
PAR of a DMT system is upper bounded at 14.3 dB. With 
an upper limit of 6 dB of peak reduction, any systems with 
an unshaped PAR of over 20 dB may achieve lower PAR by 
clipping instead of shaping. However, we may clip the shaped 
system. When we apply clipping to the shaped constella- 
tion, the amount of peak-power reduction decreases. Figure 7 
shows the clipping probability curve. JYom the curve, we see 
that the amount of peak-power reduction at a clipping rate of 
1CT 8 is about 2.1 dB for the 31-tone case and 2.0 dB for the 
215-tone case. As the clipping rate decreases asymptotically 
to 0, the amount of peak-power reduction approaches those 
without clipping (4.79 and 5.22 dB). 



4 If the experimental value for M uns h a ped is used instead, the asymp- 
totical gain is 5.67 dB. 



We have introduced the constellation shaping approach in the 
peak-power reduction problem for DMT systems and design 
a spherically shaped constellation. As the size of the con- 
stellation for each channel becomes large, the peak-power re- 
duction achievable by these constellations is about 3-6 dB 
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Figure 6: The bit allocation schemes used by the two simu- 
lated channels. 



without clipping. However, with clipping, the gain is reduced 
to about 2 dB. The hyper-spherical is one example of shaped 
constellations and are by no means optimal for the peak-power 
reduction. Recent discovery has shown that other choices of 
boundary may lead to even more reduction of peak power at 
the expense of higher computational complexity. [14], [15] 

There are certain drawbacks of the shaping-based peak- 
power reduction methods proposed here. First, they require 
relatively large constellation size (and thus high data rate) to 
obtain significant reduction, which may limit their application 
to shorter loops in the DSL application. However, most DSL 
systems are designed with high data rate in mind. More- 
over, we may use a shaped constellation with more points 
than required and apply coding to both reduce the data rate 
and increase the reliability of the system simultaneously. A 
second concern may be that the channels of the shaped con- 
stellation become correlated. We can apply coding on top of 
the shaped constellation to partially remedy this problem, as 
convolutional codes are applied on top of the 16-D shaped 
constellation in V.34 modems. Coding for DMT systems re- 
mains a topic for future research. 
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Abstract 

In this paper, we introduce a novel scheme using a 
constellation shaping approach to reduce the peak-to- 
average ratio (PAR) in orthogonal frequency-division 
multiplexing (OFDM) systems. In time domain, the 
peak power bound traces out a hypercube boundary. 
We map this square time-domain boundary back to 
the frequency domain via the inverse DFT and con- 
struct a method for indexing the OFDM constellation 
points. The encoding and decoding of the constella- 
tion use generators and relations from group theory. 
The end result is a coding scheme with nearly 20 dB 
of PAR reduction with no reduction in data rate or 
performance. 

1 Introduction 

In an orthogonal frequency division multiplex 
(OFDM) system, the output time samples are gener- 
ated by the inverse FFT of the constellation points. 
For the baseband case, 

x n =J^Y k -e-^ nk (1) 

fc=0 

for n = 0,1,..., JV and A; = 0,1,...,JV- 1. OFDM 
systems can operate either in baseband (as in the 
ADSL standard) or in passband; we examine only the 
baseband case here, although the method applies to 
both variations. We restrict x = [ xo • • • xyv-i ] 

IP be real. This allows us to define X = 
ReY 0 ReYk ImYi ImV^ 1 and 

An as columns of sin(27r^) and cos(27r^), and 
we have: 

x = A N X (2) 

In an ordinary OFDM system, {Yk)k=o ls a set °* 
i.i.d. discrete random variables uniformly distributed 



over a QAM constellation, C2. By the central limit 
theorem, as N becomes large the set of time samples, 
{ x n}n^o becomes a set of i.i.d. Gaussian random 
variables with variance increasing linearly with N. 
Thus, the time samples may occasionally have very 
high output levels, which leads to the requirement of 
an expensive, highly linear, and power-inefficient ana- 
log front end (AFE) and/or a clipping mechanism to 
limit the time sample magnitude, which leads to im- 
pulsive noise and performance degradation [1]. High 
PAR is arguably the greatest drawback of OFDM. 

Numerous methods have been proposed to reduce 
the PAR of OFDM. Currently, most of the exist- 
ing schemes approach this problem by modifying the 
transmitted signal. One class of methods clips the 
transmitted signal intelligently so that the degrada- 
tion due to the impulsive noise is minimized, or the 
receiver can use a priori knowledge of the clipping 
mechanism to decode the signal reliably [2], [3]. How- 
ever, this class of methods degrades the performance 
of the overall system by introducing artificial noise 
into the system. Various block coding schemes have 
been used [4], [5], [6]. These schemes lower the overall 
data rate of the system and essentially trades band- 
width for lower peak amplitude. Another class of 
methods [7] tries to utilize some unused bandwidth in 
the OFDM system to cancel out the large-amplitude 
samples generated by the other information-bearing 
channels. This method requires frequency spectrum 
that could otherwise be used for transmission of in- 
formation. Finally, there are methods which adjust 
the phase of the constellation points in order to avoid 
phase alignment that leads to large magnitude time 
samples [8]. 

In this paper, we propose a method for peak am- 
plitude reduction in OFDM systems based on con- 
stellation shaping. The peak amplitude of the time 
samples are measured by the 00 norm in the time 
domain. We map this hypercube boundary back to 
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the frequency domain. The hypercube boundary be- 
comes an N-D parallelogram. To encode and decode 
the constellation points inside this new boundary, we 
use group theory to compute the generators for in- 
dexing these points. This method can provide nearly 
20 dB of PAR reduction in practical situations while 
maintaining equivalent data rate and performance. 
In addition, it can be combined with other existing 
methods to further increase the PAR reduction. 



2 Constellation Shaping 

For ordinary OFDM systems, we have an N-D lattice 
in the frequency domain, and we choose a boundary 
in the N-D space such that all lattice points inside the 
boundary are considered to belong to the constella- 
tion. Typically, the boundary is square or rectangular 
in nature because the input bits can be easily divided 
into subgroups each of which indexes one frequency 
dimension. However, when a constellation-dependent 
metric needs to be minimized, a rectangular constel- 
lation may not be the best choice [9], [10]. 1 Instead, 
a properly-chosen boundary may lead to better per- 
formance in terms of the particular metric without 
lowering the data rate or increasing the symbol error 
rate. We illustrate this idea with an example. In our 
case, the goal is to minimize the peak power, or the 
oo-norm, Hx^, in the time-domain. 

Example 1 Suppose we use a 2-point inverse 
DFT to create a two-channel real-valued OFDM sig- 
nal. The modulation matrix is given by: 



A 2 



<i-\) 



(3) 



tity min 
c 



(x|c" AjvX " c 



) =i rC^ C ) ||x|| ~)- 



max 

x€AAr(C) ' 

(Note that A N (C) = {x : x = A^X for VX € C}.) 
One choice that leads to better performance is shown 
in figure 2. In figure 3 and 4, the corresponding time- 
domain sample vector, x = [ X\ x 2 ] , for each con- 
stellation point is plotted. The dotted lines indicate 
the equi-metric line for the maximum time-sample 



1 Id general, constellation shaping is a precoding technique 
in which a proper boundary for a multi-dimensional constella- 
tion is selected to match a certain metric of interest. See [9], 
[10] for details. 



magnitude for the unshaped and shaped case. The 
overall PAR reduction ts 1.5 dB. As we will see later, 
for higher dimensions, the PAR reduction is much 
larger. 




Figure 1: The unshaped, square, 2-D 16-point con- 
stellation. (4-PAM for each dimension.) 



2 o 



A 16-point, square constellation (see Figure 1) 
is used for the unshaped baseline case. (Note 
that this is not a QAM constellation, but a 2-D 
joint cross-constellation of frequency 1 and 2.) If 
X = [ X\ X2 ] is a constellation point belonging 
to the constellation C, we wish to find the quan- 



.0" o. 



0 

X1 



Figure 2: The shaped 2-D 16-point constellation. 



The constellation boundary is usually determined 
by the metric that we want to optimize. In the prob- 
lem of PAR reduction in OFDM systems, we use the 
oo-norm, H H^, in the time domain. This metric 
traces out a square boundary, defined by Hx)^ = /?, 
in the time domain. In the frequency domain, we get: 



2 




Figure 3: The time samples generated by the un- 
shaped constellation. Note that the worst-case am- 
plitude is 1.5. 




Figure 4: The time samples generated by the shaped 
constellation. Via shaping, the largest peak ampli- 
tude is reduced to 1.25. 



ll x lloo = H A * x lloo = This is an N-D parallelo- 
gram in the frequency domain denned by A^ 1 . 

3 Group Construction of the 
Shaped Constellation 

In the previous section, we observed that for the real- 
ified, inverse DFT transformation, A^, there exists 
a constellation that matches the transformation in 
such a way that maxxec IIA^X^ is minimized. In 
this section, we introduce a systematic way to ob- 
tain these constellation points. This algorithm uses 
relations and generators on a free abelian group to 
construct a quotient group which turns out to be the 
constellation points we need (aside from a transla- 
tion.) 

First, we recall some definitions [11]. 

Definition 1 An abelian group F is free abelian if 

there exists a subset X C F of elements of infinite 
order, called a basis of F, such that F = Ylxex ( x ) 
where {8*} is a cyclic group with the generator of 6i* 
In other words, F is isomophic to Z N . 

Definition 2 An abelian group G has generators X 
and relations A if G ~ F/R, where F is the free 
abelian group with basis X — (ui,U2, . . . ,uyv}, A = 
{£i,$2j • • • >$m} is a set of Z-linear combinations of 
elements of X, and R is the subgroup of F generated 
by A, or 

R = (6i)®(6 2 )®-~®(6 M ) (4) 
where 0 is the direct sum of two subgroups. 

In our construction, given an AT-D linear trans- 
formation A at, we take F to be the integer lattice, 
Z N . Obviously, F is free abelian and the basis are 
X = {ei ,e<2, . . . ,e N } where e* has zero elements ex- 
cept for the i th element which has a value of 1. These 
vectors are the natural basis for an AT-D vector space. 

Theorem 1 (Rotman, Corollary 11.2) Every 
group G is a quotient group of a free group. 

Next, we use the relations to specify the boundary 
of the group. A relation, 5, is a Z-linear combina- 
tion of elements of X that is equal to the identity 
element; in other words, 6 € F and 6 = 0. The set 
of relations for a specific A^ 1 define the edges of the 
parallelogram. An AT-D parallelogram is determined 
by N distinct edges. Recall that A^ 1 maps the opti- 
mal time-domain boundary to the optimal frequency- 
domain boundary. In the time domain, each corner 



3 



of the hypercube boundary may be represented by 
an N-D binary vector, c. In the frequency domain, 
the corners become A^c. All 2^ corners can de- 
rive from any TV corners. Thus, we may use c = 
{ei , e2, . . . , e N }. Then, each column of A^ 1 defines a 
edge of the optimal N-D parallelogram boundary and 
a relation in the quotient group F/R. If the columns 
of A^ 1 have non-integer elements, we simply round 
them to the nearest integers. 

It may be possible that the rounded version of A^ 1 , 
denoted as [A^ 1 ] , may not have enough constellation 
points in it. To solve this, we simply scale A^ 1 by 
a factor a and round it again to get A = [a • A^ 1 ] . 
We choose the smallest a such that |det ([a; • A^ 1 ]) | 
is greater than or equal to the number of constella- 
tion points. Note that each column of A specifies a 
relation in F since each corner of the parallelogram 
belongs to the same equivalent class, specifically, the 
equivalent class [0]. For this reason, we now refer to 
A as the relation matrix. 

Example 2 We use the previous example. First, 



*-(ur-oi) 



(5) 



Since |det(A 2 1 )| = 2 < 16, we need to choose a 



*2 

bigger a. For a - 3, |det ([a • A 2 = 18. In 
Figure 5, we draw out the free group F, the subgroup 
R generated by A, and the quotient group (or the 
shaped constellation.) Note that we did not use two 
points in the previous example because we needed only 
16 constellation points. 

We now have the quotient group F/R, where R 
is the group generated by A. However, the order of 
this group, in general, is very large - making tabula- 
tion impossible. Instead, we should study the relation 
matrix A via row and column operations [12], This 
procedure is very well-known and well understood in 
the area of abstract algebra. 

Theorem 2 (Armstrong, Ch. 21-22) Any rela- 
tion matrix A can be decomposed into A = UDV, 
where D is diagonal with entries {c^}^, where the 
columns of TJ are the generators, and V is the trans- 
formation between the coordinates in terms of X and 
the canonical coordinates. 2 



3.1 Encoding Process 

We now have a set of generators {u X) u 2 , . . . , u N } and 
their corresponding orders {oi,<72, • • • ,^iv}- The en- 
coding process involves mapping an integer J between 

2 For a more comprehensive treatment, please refer to [12]. 



o 

X1 



Figure 5: The free group F is the lattice depicted 
by the set of dots. The group R generated by the 
relations is the set of circles. The quotient group 
F/R is the set of diamonds. Note the dotted Ones 
illustrate the cosets of F/R. 



0 and \C\ — 1 to an N-D integer vector. This vector 
corresponds to the AT-channel OFDM constellation 
point. First, we turn the input integer into a canon- 
ical coordinate A = (Ai, A 2 , . . . , \n) by iteratively 
computing 



Xi = / mod a* 



(6) 



for i = 1, 2, . . . , AT. Next, simply use the generators 
to compute the final constellation point: 3 

X = [Ami] + [A2W2] + ■ • • + [Xnun] (7) 

The peak-limited transmitted signal is then x = 
DFT- 1 (X), exactly as in conventional OFDM. 

3.2 Decoding Process 

Decoding a received OFDM constellation vector X 
is even simpler. First, compute X from the received 
time sample vector x. In the presence of noise, we 
simply round each elements of X to the nearest inte- 
ger. Then, form 



A = X r V = [ Ai A 2 — Ayv ] 



(8) 



3 Note that the addition is done in the quotient group F/R. 
Therefore, we are actually adding equivalent classes. Follow- 
ing abstract algebra convention, the addtion of two equivalent 
classes, [a] and [b] is defined as: [a] + [6] = [a + 6]. See [13] for 
details. 
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Next, for i = 1, 2, . . . , AT, compute 

Xi = Atmoddi (9) 
Finally, the original input is recovered by 
I = Ai + <ti (a 2 + <x 2 (• • - ((jyv-i ■**)•••)) (10) 

Due to limited space, we state without proof the fol- 
lowing theorem. 

Theorem 3 The encoding and decoding processes 
can each be done in O (AT 2 ) . 

To conclude this section, we present some results 
using this algorithm. Note that mean power of the 
shaped constellation is slightly larger (typically about 
5%.) However, PAR reduction via increasing the av- 
erage power seems to obfuscate the performance mea- 
surement; thus, we approximate the average power of 
the two constellations to be the same. Figure 6 shows 
the total amount of peak-power reduction using this 
constellation shaping with various numbers of chan- 
nels and constellation sizes. We see that reduction of 
over 25 dB is possible when the constellation size is 
large. Even with a typical constellation size, a PAR 
reduction of over 15 dB is easily realized. 

4 Conclusion 

We have demonstrated the effectiveness of the con- 
stellation shaping approach in the PAR reduction 
problem for OFDM systems. As the size of the 
PAM constellation for each channel becomes large, 



the peak-power factor reduction is of the order of 
0(N), where N is the number of channels. Thus, 
it asymptotically provides an unlimited amount of 
PAR reduction as the number of channels and the 
constellation size increase arbitrarily. 

In addition, this approach can provide large PAR 
reduction without lowering the data rate or increas- 
ing symbol error rate. All clipping schemes introduce 
some artificial noise into the system; thus, they in- 
directly increase the signal-to-noise ratio. All coding 
schemes lower the overall data rate. Even the selec- 
tive mapping scheme robs some of the transmission 
bandwidth for sending information regarding the se- 
lected phase rotation. In order for the performance to 
increase, the data rate must be sacrificed in these sys- 
tems. The constellation shaping approach preserves 
the data rate and minimum distance between neigh- 
boring constellation points and yet provides a large 
PAR reduction. Another advantage of the shaping 
method is that it is complementary to existing meth- 
ods. It is possible to combine this method with others 
to further reduce the PAR of the constellation. Fig- 
ure 7 depicts one possible configuration where we use 
the methods of [8], [7], [3] to further reduce the PAR. 
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ABSTRACT 



Bandwidth-limited high-rate digital communica- 
tions systems use large constellations such as 
M-ary QAM, M € {64,128,256,512,1024}. These 
square or cross-shaped constellations, though easy 
to implement, do not pack the symbols together as 
efficiently as possible, and besides are symmetric 
when the symbols are equiprobable and indepen- 
dent, so that no channel information is available 
from blind third-order statistics of the channel 
outputs. We develop high-order "optimum" [1] 
hexagonal constellations, some of which are natu- 
rally asymmetric, and all of which can be modified 
to be asymmetric without exceeding the peak and 
average power levels of the corresponding M-ary 
QAM constellations. 



order statistical estimates may converge more quickly 
than their fourth-order kin, and lower-order blind equal- 
ization algorithms can be much simpler than higher-order 
algorithms. 

Here we investigate high-order constellations whose sym- 
bols are on hexagonal lattices. These constellations have 
been proposed in the past as "minimum error" alphabets, 
having an asymptotic advantage over QAM- type constel- 
lations of 0.8 dB in the average power needed to assure a 
particular symbol separation [1], [2], [4]. From our per- 
spective,, these constellations are interesting as alterna- 
tives that are either naturally asymmetric or that have 
some leeway - i.e. 0.8 dB - for introduction of a con- 
trolled amount of asymmetry. We find the "optimum" 
M-ary hexagonal constellations for M = 128, 256, 512, 
and 1024 and comment on methods for enhancing their 
level of asymmetry. 



INTRODUCTION 



When bandwidth is limited and data rates are high, digi- 
tal communications systems employ large constellations of 
closely-packed symbols. M-ary QAM constellations with 
M = 64, 128, 256, 512, and 1024 satisfy the requirement 
of sending 6, 7, 8, 9, or 10 bits per symbol, assuming that 
the transmitted symbols are equiprobable and indepen- 
dent. M-ary QAM constellations are simple to construct 
using a pair of identical in-phase and quadrature modula- 
tors, and the symbol decision map is readily defined by a 
small set of comparators. On the other hand, QAM- type 
constellations are known to be suboptimum in terms of 
symbol separation for a given average transmitted sym- 
bol power [2], and they are symmetric. 

A constellation is symmetric if one or more complex rota- 
tions by 0 r € (0, 2?r) result in a constellation with identical 
symbol values and probabilities. Assuming equiprobable 
i.i.d. generation of symbols from a QAM constellation, 
there are three such rotation angles. When the receiver 
is blind, it is not possible to generate an absolute phase 
reference, necessitating differential encoding for proper re- 
covery of the data. Also, the third-order statistics of sym- 
metric constellations contain no information about the 
channel. When there is inter-symbol interference (ISI) 
that is potentially mixed-phase in nature, statistics of at 
least fourth order are necessary to guarantee optimum 
channel identification or equalizer tap settings [3]. 

An asymmetric constellation allows a blind absolute phase 
estimate at the receiver and enables blind equalization of 
mixed-phase channels using third-order statistics. Third- 



SYSTEM MODEL 

The transmitted symbol sequence {z[n]} consists of i.i.d. 
symbols from the constellation S. The receiver knows 
the values of the symbols and their probabilities. There 
are no pilot tones or pilot symbols, so the receiver must 
process the channel output sequence using only the 

knowledge of the constellation statistics. 

y[n] = ^ hkX l n ~ *] + ™M W 
k=ki 

The channel output in (1) contains both 'linear ISI 
- represented by the channel coefficients , . . . , A-i, 
h+\ , . . . , hk 2 - and additive noise. The noise sequence 
{w;[n]} is often treated as being white and Gaussian. Even 
when it is not Gaussian, it may be symmetric, so that 
third-order statistics of the noise may fall out of the equal- 
ization equations. 

The receiver produces an estimate x[n] of the symbol 
x[n\. The final decision as to the transmitted symbol is 
that x € S nearest to x[n]. If the channel coefficients 
are known, x[n] can be the output of a feed-forward lin- 
ear filter or of a decision-feedback structure [5]. When 
the channel coefficients are not known, they may be esti- 
mated, subsequent to which filter taps are derived, or the 
tap values may be generated directly. 

A measure of asymmetry is 

7 = E{\x[n)\ 2 X [n)}, (2) 



which is a third-order statistic. If we apply a similar 
statistic to (1), 

F ttm = E{y[n]y*[n + t]y[n + m]} (3) 

we find that 

Ft,m = 7 ^2 h * h k+thk+m (4) 

with symmetric white noise. Information about the am- 
plitude and the phase of the channel can be obtained from 
a suitable set of F £ , m statistics. 

When undertaking blind equalization, each F t , m will be 
estimated using a sequence of channel outputs. A larger 
value of Ft tm is preferable to a smaller one, as an error 
in the estimate is then relatively smaller. The constel- 
lation design may help or hinder timely convergence of 
the statistical estimates via the direct scaling of the F t , m 
statistics by 7. 



HEXAGONAL CONSTELLATIONS 



Hexagonal constellations appear in [1] and [2] as poten- 
tial solutions to the problem of finding the constellation 
that minimizes the average transmitted power given a re- 
quired minimum symbol separation. The hexagonal con- 
stellations considered had 8, 16, 32, and 64 symbols, and 
analysis of hexagonal and square-lattice (e.g. QAM) con- 
stellations showed that hexagonal constellations have a 
0.8 dB advantage over QAM constellations with identical 
minimum symbol separation. 

"Optimum" hexagonal constellations with 128, 256, 512, 
or 1024 symbols have been of little interest even though 
QAM constellations of these sizes have been used in high- 
rate bandwidth-limited communications systems. A prin- 
cipal reason for this is the simplicity of symbol generation 
and decision-making with QAM constellations. However, 
analogous structures for hexagonal constellations are no 
longer much of a problem due to advances in digital and 
mixed signal processing technologies. A main reason for 
renewed interest in hexagonal constellations is the po- 
tential for asymmetry at no cost in average transmitted 
power beyond that of QAM constellations currently in 
use. Indeed, depending on the amount of desired asym- 
metry, up to 0.8 dB of power can be saved. 



A. Construction Method 

For purposes of comparison, all of the constellation mea- 
surements will assume a unit minimum symbol separa- 
tion. The symbols of the constructed constellations oc- 
cupy points of a hexagonal lattice, while those of the QAM 
constellations occupy points of a square lattice. 

We will require that the hexagonal constellations have 
no DC component. This is a common feature that aids 
in spectral shaping [5]. Were there a DC component, it 
would act as a sequence of pilot symbols added to the 
transmitted data symbols and could be used by the re- 
ceiver accordingly. 

The first step in the construction process is to list the 
points on a unit-spaced hexagonal lattice within a cer- 




Fig. 1. Points on a Hexagonal Grid 



tain radius of some central point. Figure 1 shows several 
points on such a lattice. The gray triangle shown in the 
figure depicts a region of space in which the center of the 
constellation may be located relative to the lattice points. 
All locations outside the gray triangle are equivalent to 
locations within it due to the symmetric nature of the 
lattice. Note that there are three points that will pos- 
sibly result in a symmetric constellation. These are the 
three vertices of the triangle. A hexagonal constellation 
centered on the upper vertex may exhibit 2-fold, 3-fold, or 
6-fold symmetry. A constellation centered on the left ver- 
tex may exhibit 2- fold symmetry, and on the right vertex 
3-fold symmetry. 

From a given starting point, we take the M lowest-power 
symbols in the list as potential members - less their mean 
- of the A/-ary "optimum" hexagonal constellation. The 
mean provides a new starting point. Using this point, 
we select the M lowest-power symbols and form a zero- 
mean constellation. If the minimum distance between 
each point in one constellation and the points in the other 
is zero (or close enough to zero, since we carry out the 
computations on a computer), the process ends. Oth- 
erwise, it repeats until such time as successive potential 
constellations match. We begin the search for the con- 
stellations on each of the vertices, and choose the lowest- 
average-power results as the "optimum" constellations. 

As a test, we used this procedure to find the "optimum" 
M-ary hexagonal constellations for M = 4, 7, 8, 11, 16, 19, 
and 32 and found that the constructed constellations 
agreed with those of the literature [1], [2]. 



B. 64- Point Constellations 

The first high-order hexagonal constellation we examined 
is an "optimum" 64-HEX, shown in Fig. 2 [1]. This con- 
stellation is symmetric, since a rotation by 7r preserves 
the symbol values. The starting point for construction 
of this algorithm was the upper vertex of the gray trian- 
gle in Fig. 1. The average power of this constellation is 
Pavg = 8.8125 and the peak power is P pcak = 16.125. 

When the starting point is instead the left vertex, the 
suboptimal 64-HEX- SO constellation of Fig. 3 is gener- 
ated. The average power of this constellation is P avg = 
8.8254 - slightly greater than that of the 64-HEX - but 
7 = -0.1424 + 0.0237.7. 

A 64-QAM constellation has P a vg = 10.5, P pC ak — 24.5, 
and 7 = 0. In terms of average power, hexagonal constel- 
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Fig. 2. Symmetric "Optimum" 64-HEX Constellation 



Fig. 4. "Optimum" 128-HEX Constellation 
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Fig. 3. Asymmetric Suboptimal 64-HEX-SO Constellation 



Fig. 5. "Optimum" 256-HEX Constellation 



lations have a 0.75 dB advantage. In terms of peak power, 
the 64-HEX advantage is 1.65 dB while the 64-HEX-SO 
advantage is 1.4 dB. 



E. 512-Point Constellation 



C. 128-Point Constellation 



The "optimum" 128-point constellation appears in Fig. 4. 
For this constellation, 7 = -0.2044, P avg = 17.6366, and 
Ppeak — 34.519. The corresponding statistical quantities 
of the cross-shaped 128- QAM constellation are 7 = 0, 
Pavg = 20.5, and P pea k = 42.5. In dB, the average and 
peak power of 128-HEX show improvements of 0.65 and 
0.9 compared with the power measurements of 128- QAM. 



D. 256-Point Constellation 



Figure 5 shows the "optimum" 256-HEX constellation 
with 7 = -0.1662 + 0.0569j, P avg - 35.2536, and P peak = 
71.77. For 256-QAM P avg = 42.5, P pea k = 112.5, and of 
course 7 = 0. The average and peak power advantages of 
256-HEX are 0.81 dB and 1.95 dB. 

The 256-HEX constellation is beginning to look somewhat 
circular in shape, as opposed to the hexagonal constella- 
tions with fewer symbols, which were distinctly hexagonal. 
The 256-QAM constellation is simply a large square of 16 
symbols by 16 symbols. 



Figure 6 depicts an "optimum" 512-HEX constellation. 
The geometry of the constellation is quite circular - a 
"best" packing of the symbols into the smallest possi- 
ble area. The statistics for 512-HEX are 7 = -0.06209, 
Pavg = 70.53, and P pcak = 143.43. For 512-QAM, an- 
other cross-shaped constellation, the statistics are 7 = 0, 
Pavg = 82.5, and P peak = 188.5. 

Though 512-HEX shows 0.68 dB and 1.19 dB P a v 9 and 
Ppeak improvement compared to 512-QAM, and has a 
non-zero 7, it is clear numerically and to the naked eye 
that 512-HEX is only weakly asymmetric. In order to en- 
able blind equalization in a reasonable amount of time, 
some of the power improvement should be traded off dur- 
ing constellation design for an increased 7. 



F. 1024-Point Constellation 



The final "optimum" hexagonal constellation is the 1024- 
HEX constellation of Fig. 7. 1024-HEX has 7 = 0.12146, 
P avg = 141.13, and P pea k = 279.9. 1024-QAM has 7 = 0, 
Pavg = 170.5, and P pca k = 480.5. By switching from 
1024-QAM to 1024-HEX, it is possible to reduce the P avg 
by 0.82 dB and P pea k by 2.35 dB. 



; 0 1 , o o o o o o o o o o 

oo;Ooooo|ooooo;66 
00090000900000000 

: •■'ooooio o o o oia o o o oio 1 o o o" i 

■ 0000600006000060000 ■ 

:-o o-o o o-'o o 0-0 O;0 o 00000000; 

000000000000000000000 
0;p . 0.0 .0 0 p o o 0.0:0 ,0 o 0.9:9 .9. 9 9..9i9. 
06000060000600006000060 

o o|o 000 o;o 000 o;o 000 o;o 000 o;o o 

0 0 0 0 0 0 '9 0 0 0 0 9 0 0 00000 o o o o 

o 0:0 o-o o 0:0 000 0:0 000 0:0 000 0:0 o 
•0 0 0 0 0 0 0 0 0 0 0 o 0 0 0 0 0 -6 o 0 0 0 0 0 •• 
o 0.0 000 o-o 000 ojo 000 o : o 000 ojo O 

o .0.0. o. o .0 .9 .0.0. o. o .0 .0.0. 0.0.0 .0 .0. o. 0..9 .0 .. 

o 0:0 000 0:0 000 0:0 000 0:0 000 0:0 o 
06000060000600006000060 

6Vd '6 "6 !' 6' 6-'6 0 6 6' o' : o 6" 6 do' 60' 6' 6 '<'6\ 6 

900009000090000900009 

io o o-o oio O O 0 0:0' 0"0 O 0:0' o o 0 0: ■ 

j 0000600006000060000 ': 

J....O .0.0.0-.0..0 O.O.OiO. O ,Q O.Ojo. O..O .O.. ; 

00090000000009000 
o 0:0 000 0:0 000 0:0 o 

: 6 b 6 6 6 6 i d 6 6 '6 6 i 



-10 



TABLE I 

Constellation Comparisons Given Unit Minimum Symbol 
Separation 



Constellation 


bl 


Paverage 


Ppeak 


RA HAM 

64-HEX 

04-llDA-iJV 


n 
u 

0 


8.8125 

8 89^4 
o.o*u*± 


16.75 

17 7^7 
11.10/ 


128-QAM 
128-HEX 


0 

0.2044 


20.5 
17 6366 


42.5 
34.519 


256-QAM 
256-HEX 


0 

0.17606 


42.5 
35.254 


112.5 
71.77 


512-QAM 
512-HEX 


0 

0.06209 


82.5 
70.53 


188.5 
143.43 


1024-QAM 
1024-HEX 


0 

0.12146 


170.5 
141.13 


480.5 
279.9 



Fig. 6. "Optimum" 512-HEX Constellation 
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Fig. 7. "Optimum" 1024-HEX Constellation 



low hexagonal constellations to be used in some channels 
where QAM counterparts are considered impractical. 



CONCLUSION 



High-order hexagonal constellations having M = 64, 128, 
256, 512, or 1024 symbols meet a minimum symbol sepa- 
ration requirement using 0.8 dB less average transmitted 
power than M-axy QAM constellations. So-called "opti- 
mum" hexagonal constellations [2] for these large values of 
M were found and their second- and third-order statistical 
properties tabulated. The average and/or peak transmit- 
ted power advantages of these constellations over QAM 
constellations can be traded off for enhanced asymmetry. 
This allows blind equalization of ISI-corrupted symbol se- 
quences using third-order statistics rather than statistics 
of higher order. 



G. Practical Issues 



We have proposed M-ary hexagonal constellations as re- 
placements for M-ary QAM constellations. The transi- 
tion entails use of transmitters and receivers with mod- 
estly higher complexity. Benefits include power savings 
and asymmetry. If desired, some of the power savings can 
be spent on improving the asymmetry properties of the 
constellation, again with only a minor increase in symbol 
generation and decision-making complexity. 

One issue we have not discussed is phase jitter [6]. A 
small phase error moves low-power symbols a small dis- 
tance in the complex plane, and high-power symbols a 
large distance. The symbol decision regions are effectively 
the same size for most of the symbols in the high order 
hexagonal constellations, which may bar their use in a 
jitter-susceptible channel. If this is the case, a system 
design should include a power- dependent angular phase 
separation, which would make the decision regions much 
less uniform in shape and the design process less simple 
than that presented here. 

Another difficulty is that the symbols are quite closely 
packed in the hexagonal constellations (also in the high 
order QAM constellations), so that there is little room 
for error in equalization. A small amount of residual ISI 
can lead to an exorbitantly-high symbol error rate. We 
hope that the low-complexity third-order blind algorithms 
whose use is enabled by constellation asymmetry will al- 
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